Video AI Engineer

ZoomSan Jose, CA
18hHybrid

About The Position

As a Video AI Engineer, you’ll enhance video codecs, video generation, and real-time 3D reconstruction to improve video quality, immersion, and performance in Zoom products. You will work across our stack, developing software ranging from Web Server to business application layers for our distributed, cloud-hosted backend. Working alongside leading experts in the field, you’ll deliver happiness to our users and grow your knowledge base every day. About the Team With eight specialized departments, the engineering team functions as a highly collaborative, diverse powerhouse. Each department mission is to deliver seamless and innovative communication solutions. These range from software development and machine learning to quality assurance teams that work to create and maintain Zoom's user-friendly interfaces and robust infrastructure. The team continues to push the boundaries of communication technology, bringing people together regardless of their physical distance.

Requirements

  • Hold either a PhD or Master in Electrical Engineering, Computer Science, Applied Mathematics, or related fields
  • Have experience with C/C++ or Objective-C, and Python, talking avatar/head/portrait(with released projects and top conference papers)
  • Have hands-on experience with video generation or video diffusion models, neural rendering techniques (e.g., NeRF, 3D Gaussian Splatting), and 3D reconstruction systems.
  • Have hands-on experience with machine learning techniques such as generative models, diffusion models, discriminative models, or transfer learning.
  • Possess experience with or a solid understanding of at least one deepfake detection approach. This includes biometric analysis–based methods, vision-language model (VLM)–based techniques, interactive behavior analysis, or multimodal signal modeling that leverages visual, temporal, and audio cues.
  • Have familiarity with multi-threaded programming and communication mechanisms
  • Have understanding of multimedia stream data processing flows, ideally including 3D scene or point cloud pipelines
  • Must be fluent in Mandarin

Responsibilities

  • Building and developing video and generative video processing applications on both desktop and mobile systems
  • Participating in research and performance evaluation of video processing, video generation, and 3D reconstruction algorithms
  • Designing and developing algorithms in Zoom’s video and 3D reconstruction processing pipelines at both module and system levels
  • Implementing video, neural rendering, and 3D Gaussian Splatting algorithms with modular, well-organized, and production-ready code
  • Optimizing video, generative, and 3D reconstruction algorithms to achieve real-time performance on corresponding platforms
  • Customizing, integrating, and shipping deep learning models—including video generative models and 3D neural rendering models—across Mac, Windows, iOS, and Android
  • Setting up test environments, developing test tools, and designing unit tests for runtime verification of video and 3D pipeline components
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service