Nuance Labs is seeking a deeply technical Member of Technical Staff to own Reinforcement Learning (RL) and post-training for large-scale omni models. This role involves understanding modern post-training methods, building the necessary infrastructure for large-scale execution, and contributing to RL method development, rollout generation, reward modeling, policy optimization, evaluation, data feedback loops, serving, observability, and distributed execution. The successful candidate will build Nuance’s RL/post-training stack from the ground up and scale it significantly, translating research ideas into reliable training systems. The work extends beyond text to encompass audio, video, language, and real-time full-duplex interaction, focusing on improving interactive behavior, timing, interruption, emotional response, audiovisual coherence, and real-time conversational quality. This is a high-ownership role with direct impact on model improvement post-pretraining.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed