Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states. The Large Model Evaluation team is at the nexus of Waymo’s AI ambition. With advancements in Large Language Models (LLMs) and Vision-Language Models (VLMs), Waymo is building state-of-the-art AI systems that handle the full complexity of real-world driving. At its core, our progress is defined by our ability to measure it. While robust evaluation is the bottleneck for deploying any large model, the challenge at Waymo is uniquely complex and safety-critical. We are looking for quantitatively-minded engineers to research and propose new ways to assess the ML models deployed in the Waymo Driver.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
1,001-5,000 employees