At Netflix, our mission is to entertain the world. Together, we are writing the next episode - pushing the boundaries of storytelling, global fandom and making the unimaginable a reality. We are a dream team obsessed with the uncomfortable excitement of discovering what happens when you merge creativity, intuition and cutting-edge technology. Come be a part of what’s next. AI and ML powers innovation in all areas of the business, including helping members choose the right title for them through personalization, better understanding our audience and our content slate, creating high-quality subtitles, dubbings, images, trailers, and other assets, optimizing our payment processing, and much more. The Artificial Intelligence Platform (AIP) organization builds highly scalable, differentiated AI infrastructure to maximize the business impact of all AI/ML practitioners at Netflix, which is key to accelerating this innovation. The Opportunity The AI Observability team makes AI, ML, and Agentic systems transparent, reliable, and production-ready at scale. We build end-to-end observability for ML and GenAI workloads, capturing model inputs, features, predictions, outcomes, and behavior across online and batch systems. Our platform enables teams to monitor model performance, data quality, drift, latency, and failures, turning the ML system from a black box into an explainable, debuggable system. We provide developer-friendly libraries, dashboards, and alerts so teams can debug issues, respond to incidents, and ship AI-powered products with confidence. We are looking for an experienced AI/ML infrastructure engineering leader to build and lead the next generation of our AI observability platform. You will lead this newly formed team to architect, design, develop, test, and launch a brand-new platform to enable ML practitioners across different business domains to effortlessly collect model inputs, features, and predictions for thousands of large-scale models, including Large Language Models (LLMs), computer vision, and foundation models. We are a highly collaborative team. You will be highly cross-functional in partnering with other engineering, product management, machine learning, and data teams to take Netflix’s AI/ML initiatives to the next level. To succeed in this role, you will need a strong background in AI infrastructure and a passion for building scalable, robust systems that enable and accelerate the application of AI Observability to large, complex ML models across diverse domains.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager
Education Level
No Education Listed
Number of Employees
5,001-10,000 employees