Zoom-posted 2 months ago
$137,700 - $275,400/Yr
Full-time • Mid Level
Hybrid • Seattle, WA
Professional, Scientific, and Technical Services

We are looking for a Research Scientist, with a solid background in speech recognition and speech processing. On this team you will develop state-of-the-art automatic speech recognition models on large-scale datasets for Zoom products. This role will also have you collaborating with cross-functional teams, including products, science engineering teams, to deliver high-impact projects from the ground up. The Zoom AI Speech Team is developing speech recognition technologies to improve Zoom's conversational AI experience. This includes Zoom AI Companion, Zoom Meetings, Zoom Contact Center, Zoom Phone, Zoom Revenue Accelerator. As a Research Scientist, you will develop novel automatic speech recognition (ASR) solutions to deliver a unique AI-powered collaboration platform to users across the globe.

  • Developing state-of-the-art ASR models on large-scale datasets for Zoom products.
  • Devising novel techniques where off-the-shelf solutions are not available.
  • Demonstrating technical judgment in the entire ASR development cycles, including data collection, model prototyping, training, optimization and evaluation.
  • Collaborating with cross-functional teams, including products, science engineering teams, to deliver high-impact projects from the ground up.
  • Mentoring and provide technical guidance to junior ASR team members.
  • Possess a Master's in Computer Science, Electrical Engineering or related fields with 5 years of experience.
  • Display knowledge in deep learning and hands-on programming skills in Python, shell scripts; familiarity with ML frameworks such as PyTorch and TensorFlow.
  • Demonstrate experience in speech recognition, speech processing, natural language processing or related fields in academic research or industry settings.
  • Have domain expertise in speech recognition topics: modern end-to-end ASR architectures, language modeling, decoding algorithms, on-device ASR models, personalization and adaptation, semi-/self-supervised learning, multilingual and robust ASR, LLM-integrative ASR.
  • Experience with speech recognition toolkits and libraries such as Kaldi/k2, ESPNet, NeMo, or TorchAudio.
  • Experience with large scale data processing and model training.
  • Demonstrate collaboration and communication skills.
  • A variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health.
  • Support work-life balance.
  • Contribute to their community in meaningful ways.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service