The Perception team at Zoox creates the "eyes and ears" of our self-driving robots. Navigating safely and efficiently in complex environments requires detecting, classifying, tracking, and understanding various attributes of surrounding objects—all in real-time and with exceptional accuracy. As an engineer in the Scene Understanding team, you will develop advanced Vision-Language-Action (VLA) models that perceive our vehicle's surroundings to identify hazards and make driving suggestions. You will utilize VLA models for detecting rare events and ensuring safe driving in these situations. You'll work with state-of-the-art machine learning models that operate in real-time on our robotaxi platform with minimal latency. Collaborating with world-class engineers and researchers across sensors, planning, and other teams, you'll have access to premium sensor data and cutting-edge infrastructure to validate your algorithms in real-world conditions.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
Ph.D. or professional degree
Number of Employees
501-1,000 employees