What you can expect We are looking for a Research Scientist with a solid background in speech recognition, speech synthesis, and speech processing. You will build advanced speech understanding models on large-scale datasets, transforming speech into human- and LLM-readable text to fulfill Zoom’s vision of seamless conversation-to-task completion. This role will also have you collaborating with cross-functional teams, including product, science and engineering teams, to deliver high-impact projects from the ground up. About the Team Zoom's AI Speech Team is developing speech technologies to improve Zoom's conversational AI experience. Our work contributes to Zoom AI Companion, Zoom Meetings, Zoom Contact Center, Zoom Phone, and Zoom Revenue Accelerator. You will develop novel solutions in automatic speech recognition (ASR), text-to-speech (TTS), voice agents, speech-to-speech translation, and speech-focused large language models (LLMs) to transform conversations into actionable tasks for users worldwide.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
1,001-5,000 employees