About The Position

Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time. As a Machine Learning Engineer (MLE), you will join a data org of MLEs, Analytics Engineers, and Data Scientists who partner with our infrastructure engineering teams to improve the performance, reliability and efficiency of our infrastructure systems. You will work closely with machine learning and software engineers to forecast traffic and demand on our systems, and to build models that optimize our capacity and traffic steering decisions. The ideal candidate will excel in the end to end lifecycle of designing, developing and maintaining machine learning models. They will also have experience in solving infrastructure problems on large, distributed cloud systems (e.g. AWS) such as traffic forecasting, capacity planning, traffic management.

Requirements

  • Experienced in developing and implementing machine learning models with a successful track record of driving business impact
  • Deeply familiar with the ML lifecycle and strong technical judgment when assessing different solutions for deploying models in production
  • Experienced in and motivated by the infrastructure domain, having worked on large distributed infra systems on topics such as demand forecasting, capacity planning, traffic steering, load balancing, etc.
  • An exceptional thought partner with strong communication skills, able to explain complex technical concepts clearly to cross-functional partners and business leaders
  • Comfortable with ambiguity, with a strong ownership mindset, and thrive with minimal oversight and process
  • A strong coder with experience in Python and standard ML frameworks like PyTorch and TensorFlow
  • Experience with languages like Java or C++ is a plus
  • Familiarity with optimization models with standard frameworks/solvers (e.g., XPress, cvxpy, Gurobi) is a plus
  • Familiarity with operational tooling for ML services (monitoring, alerting, etc.), and services for model hosting/serving

Responsibilities

  • Forecast and predict key business and technical inputs such as traffic volume and resource demand across our fleet of cloud services and systems
  • Build, update and maintain machine learning models to optimize our infrastructure footprint on topics such as capacity planning, autoscaling, loadshedding, and traffic steering
  • Own the end-to-end model lifecycle including ideation, feature building, training, evaluation, monitoring and continuous improvement
  • Partner with software engineers to identify high value opportunities to apply modeling techniques to improve the performance of infrastructure management systems
  • Partner with ML engineers and data scientists on model observability initiatives and experimentation to improve model design
  • Live Netflix values while bringing a new perspective to continue improving our culture

Benefits

  • Health Plans
  • Mental Health support
  • 401(k) Retirement Plan with employer match
  • Stock Option Program
  • Disability Programs
  • Health Savings and Flexible Spending Accounts
  • Family-forming benefits
  • Life and Serious Injury Benefits
  • Paid leave of absence programs
  • Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off
  • Full-time salaried employees are immediately entitled to flexible time off
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service