Sr Staff R&D Engineer
Disney
2 - 5 years
California
Posted: 06/04/2026
Job Description
Job Posting Title:
Sr Staff R&D EngineerReq ID:
10127968Job Description:
The Skywalker Sound Development Group is seeking a highly accomplished Sr Staff R&D Engineer (AI/ML) to lead the development of transformative audio intelligence technologies for global media production. This senior-level role is central to advancing our next-generation soundtrack platform, with a focus on speech processing, style transfer, upmixing, source separation, and generative audio synthesis.
You will architect, build, and optimize cutting-edge machine learning systems at scale—leveraging foundational models, neural vocoders, latent diffusion models, and advanced retraining workflows. As a core member of our applied R&D team, you will contribute to technical direction, collaborate across product and engineering, and deliver production-ready solutions that integrate seamlessly into creative and operational workflows for elite content creators worldwide.
This role is considered Hybrid, which means the employee will work onsite in our Nicasio, CA office and occasionally from home.
What You’ll Do
Lead the research, design, and implementation of state-of-the-art machine learning algorithms for speech processing, voice transfer, source separation, and upmixing in media post-production environments.
Drive the architecture and deployment of scalable model training pipelines using PyTorch and distributed computing frameworks.
Develop novel generative audio models, including latent diffusion, flow-based models, variational autoencoders, and neural vocoders, optimized for professional soundtrack production.
Own end-to-end model lifecycle management: pretraining, fine-tuning, validation, inference optimization, and CI/CD integration.
Guide the development of personalized model adaptation workflows to support per-user tuning, cross-project continuity, and flexible deployment.
Collaborate with product, platform, and engineering leads to define integration strategies within a secure, cloud-optimized SaaS environment.
Stay at the forefront of generative audio, multi-modal modeling, and self-supervised learning—translating emerging research into applied innovation.
Contribute to internal tooling and infrastructure that improves iteration speed, reproducibility, and explainability of deployed models.
Mentor junior researchers and engineers, and contribute to a culture of rigorous experimentation, collaboration, and continuous improvement.
What We’re Looking For
MSc or PhD in Computer Science, Electrical Engineering, Applied Math, or a related field with a focus on AI/ML and mult-imodal signal processing.
5 years of professional experience in applied ML, with a deep focus on audio-centric AI/ML research and deployment.
Expertise in building and scaling models using PyTorch, with fluency in training, fine-tuning, and inference for deep neural networks.
Demonstrated experience developing generative models such as VAE, GAN, diffusion models, or neural vocoders (e.g., HiFi-GAN, WaveNet).
Deep understanding of audio-specific ML domains, including source separation, speech enhancement, music processing, and cross-modal tasks.
Experience with MLOps tooling (e.g., Weights & Biases, MLflow, Datachain), Docker-based containerization, and scalable infrastructure for distributed training.
Fluency in audio signal processing fundamentals and the integration of DSP into ML pipelines.
Proven ability to contribute to architectural planning, research strategy, and production deployment in complex, multi-stakeholder environments.
Preferred Qualifications
Familiarity with audio/text/video multi-modal frameworks and cross-domain representations.
Experience implementing real-time or near-real-time inference pipelines in cloud or edge environments (e.g., AWS, GCP, on-prem GPUs).
Working knowledge of latent diffusion audio models (e.g., stable-audio, AudioLDM, AudioGen).
Strong knowledge of industry-standard audio datasets and benchmarks (LibriSpeech, VCTK, MUSDB, etc.).
Experience optimizing inference pipelines for creative applications or interactive use.
Proficiency in lower-level audio frameworks (C / C++, etc.)
Contributions to published research at top-tier conferences (NeurIPS, ICASSP, ICLR, Interspeech) and/or open-source ML frameworks.
Job Posting Segment:
Skywalker SoundJob Posting Primary Business:
Skywalker Sound-EngineeringPrimary Job Posting Category:
Software EngineerEmployment Type:
Full timePrimary City, State, Region, Postal Code:
Nicasio, CA, USAAlternate City, State, Region, Postal Code:
Date Posted:
2025-08-19About Company
The Walt Disney Company, commonly known as Disney, is a global entertainment conglomerate headquartered in Burbank, California. Founded in 1923 by Walt Disney and Roy O. Disney, the company is known for its iconic animated films, television networks, theme parks, and entertainment properties. Disney’s portfolio includes beloved franchises like Mickey Mouse, Pixar, Marvel, Star Wars, and ESPN. The company operates in multiple segments, including media networks, parks and resorts, studio entertainment, and direct-to-consumer services like Disney+. Disney’s influence extends across film, television, streaming, merchandising, and theme park entertainment, making it one of the most recognized and successful entertainment companies in the world.
Services you might be interested in
Improve Your Resume Today
Boost your chances with professional resume services!
Get expert-reviewed, ATS-optimized resumes tailored for your experience level. Start your journey now.
