Are you a hands-on Computer Vision and Audio Scientist with a solid background in Machine Learning who loves solving challenging real-world audio-visual learning problems and is eager to design solutions that will help us revolutionize content creation for films and TV?
Do you want to help revolutionize the way that Film and TV are experienced across the globe? Do you dream of working with the world's best Technologists, Visual Effects Editors, and AI Scientists? Then look no further!
DS Group have partnered with a company with the first AI-based technology that can visually translate a film or TV show into any language! All whilst preserving the original actor's performances, so you can enjoy it without annoying subtitles or poor dubbing! Their recent launch has attracted interest from several global streaming services & film studios, cementing their position as pioneers in a multi-billion-dollar industry.
To take full advantage of this position, they are expanding their Research team in LA with multiple Research Engineers to contribute to novel AI-based audio technology deployed at scale to turn science into unique business products.
Requirements
To succeed, you should have:
Demonstrable 3+ years research experience (academic or industry) with Computer Vision, Signal Processing, and Deep Learning in at least four of the following:
Neural networks and transfer learning
Generative/adversarial models esp. for visual speech synthesis
Multi-modal fusion and cross-domain adaption techniques for audio-visual learning
Recurrent and visual attention models for audio-video processing
Audio-driven face animation
Speech modeling and synthesis esp. with deep networks
Statistical sgnal processing techniques mainly for audio data
Scholarly work in computer vision, signal processing, and machine learning venues.
Good background in statistical methods and numerical optimization
Excellent coding skills in Python
Proficiency with DL frameworks such as PyTorch, Tensorflow, or Keras
Good communication skills to collaborate in a team with researchers, ML engineers, and VFX artists
MS or Ph.D. in Computer Vision, Speech Recognition and Synthesis, Machine Learning, or related fields
You’ll shine if you have experience with:
Multi-modal audio separation
Developing tools or solutions at scale for multi-modal or audio-driven visual systems in AR/VR or VFX
C++ and CUDA
Cloud platforms such as GCP and AWS
Virtual environments such as Anaconda
Audio and post-production tools such as Avid, Resolve, or After Effects
Benefits
What you'll get from The Company:
Autonomy - you own the work that you do from start to finish, and you'll have the opportunity to influence research ideas and independently implement and evaluate them in modern deep learning and company tech stack at scale.
Career growth in science - you’ll be up to speed with major scientific advances, contribute ideas to scholarly articles and patents, and participate at major conferences in AI-related fields.
Learning & development - you'll be working with the world's best Technologists, Visual Effects Editors & AI Scientists to push the boundaries of what is possible.
Inclusive culture - collaboration is essential, everyone's opinion and input are genuinely valued.
Be part of something BIG - they are changing the Film & TV industry for the better, breaking down language barriers, and bringing people closer together.
Working for this Company:
Enjoy their flexible working environment and hybrid office model.
Benefit from their shared success - all new joiners are given stock options.
Competitive salary
Celebrate together - work across teams to solve challenging problems and push the boundaries of video content creation.
Networking in the film industry
Medical / dental / vision insurance
Basic 401(k) plan
Corporate laptop