Location: | Guildford |
---|---|
Salary: | £36,024 to £39,347 per annum |
Hours: | Full Time |
Contract Type: | Fixed-Term/Contract |
Placed On: | 1st October 2024 |
---|---|
Closes: | 27th October 2024 |
Job Ref: | 047124 |
The University of Surrey is a global community of ideas and people, dedicated to life-changing education and research.
We are ambitious and have a bold vision of what we want to achieve - shaping ourselves into one of the best universities in the world, which we are achieving through the talents and endeavour of every employee.
Our culture empowers people to achieve this aim and to collectively, and individually, make a real difference.
The role
Applications are invited for a Research Fellow (RF) position for 10 months within the Centre for Vision Speech and Signal Processing (CVSSP), University of Surrey, UK, to work on generative AI for social music game design.
The post is funded by Research England and Surrey County Council, under the GAIN program. The project is a collaboration with industry partner Record Games. The focus will be to develop generative machine learning models and signal processing algorithms for music sound generation. This work is built on the recent contributions of the CVSSP audio team in generative AI models for audio generation, such as AudioLDM and AudioLDM2, with a focus on exploring the performance of using such models for music instrument sound generation and style transfer (e.g. from singing voice to trumpet sound).
The post-holder will be based in CVSSP, and work under the direction of the Principal Investigator Prof Wenwu Wang, in collaboration with the industrial partner.
CVSSP is an International Centre of Excellence for research in Audio-Visual Machine Perception and AI, with over 180 researchers. The Centre has state-of-the-art audio and video capture and analysis facilities supporting research in real-time video and audio processing and visualisation. CVSSP has a compute facility with 1000 CPUs, 200 GPUs and >2PB of high-speed secure storage.
About you
The post-holder is expected to have a PhD degree (or equivalent) in generative AI, acoustic signal processing, cross-modal processing among audio and text, machine learning, or a related area in electronic engineering, applied mathematics, computer science, and statistics. The post-holder is expected to have strong analytical skills and programming skills such as Python, C/C++, or Matlab. Preference will be given to those who have experience on generative AI models, audio generation, cross modal translations (such as, text to audio), but candidates who have experience in machine learning and audio signal processing are welcome to apply.
How to apply
Please apply with your CV and cover letter with your application on the University website.
For informal inquiries, please contact Prof Wenwu Wang (Email: w.wang@surrey.ac.uk; Web: https://personalpages.surrey.ac.uk/w.wang/).
The University of Surrey reserves the right to close this vacancy early based on Volume and Calibre of applications.
Further details
For more information and to apply online, please download the further details and click on the 'apply online' button above.
Type / Role:
Subject Area(s):
Location(s):