Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time. The Opportunity The Model Observability & Lifecycle Management team’s centralized MLOps platform multiplies the productivity of both the Machine Learning Platform (MLP) organization and all ML researchers across Netflix. We maintain the reliability of ML applications by building systems to catch and diagnose issues as soon as possible, sometimes before they even happen! The Build and Release (B&R) group within OLM is building a novel monorepo environment to provide build, testing, CI/CD, and developer tooling for our ML engineers and researchers. Engineers enjoy standardization in build toolchains, dependency management, runtime version migrations, and documentation tooling, allowing them to focus on their work, not wrangling their tools. We are seeking strong build engineers with experience with B&R best practices at scale. Our work supports B&R operations for our most critical applications, such as real-time inference services, feature computation and serving, ML model representations, and much more. You’ll play a foundational role in establishing the practices used by hundreds of engineers and ML researchers across dozens of use cases. Our team is small and growing, so you’ll mentor incoming B&R engineers as they grow into their roles supporting our organization. To be successful in this role, you must have a strong software engineering background, a keen sense of software design, and experience operating large CI/CD systems. Snapshot of projects you may work on: Help make foundational technology decisions with an eye toward large-scale repository management. Define best practices for large-scale monorepos, influencing how hundreds of engineers work daily. Anticipate and prepare for scaling opportunities as our repository grows beyond the capabilities of existing tools, such as build time reductions, flaky test handling, build tool migrations, and more. Onboard existing repositories as they enter the monorepo, including harmonizing their homegrown toolchains with our standardized offers. Expand B&R support to accommodate new languages. We currently support Java and Scala, and are expanding to Python. Support teams with company-wide migrations and version upgrades, such as build and runtime environment versions and library versions. Work alongside company-wide build experts to incorporate existing company-wide tools into our environment. Create measurement harnesses to measure build performance, mean time to failure, and other critical developer velocity metrics.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
5,001-10,000 employees