Senior Software Engineer, Machine Learning Platform

The New York TimesNew York, NY
1dHybrid

About The Position

The New York Times is looking for a Senior Software Engineer, with a focus on ML platform development, to join our Machine Learning & AI team to help solve creative challenges around Machine Learning infrastructure. Machine Learning (ML) at the New York Times enhances the experience of our 150 million digital readers from around the globe and grows our subscriber base through content recommendations and personalizations. The Machine Learning & AI team builds and maintains the infrastructure that hosts all of The New York Times real-time ML inference models, including both data and compute. Our partners are Data Scientists that build and deploy their ML models on the ML platform. On the other end, our partners are engineering systems that call these hosted models at scale with low-latency and Service Level Agreements (SLAs) guaranteed by our platform. This role is a hybrid role, based in New York City. You will report to the Engineering Manager of the Machine Learning Platform team.

Requirements

  • 5+ years of direct relevant experience in MLOps or DevOps, including experience operating large systems in a production environment
  • Experience deploying and monitoring systems using cloud infrastructure (GCP or AWS)
  • Experience working with Kubernetes, Docker, and CI/CD (Drone, Argo, Jenkins, etc)
  • Experience leading the development of large-scale, data-driven, distributed multi-tenant systems
  • Proficiency with at least one high-level programming language like Python or Go

Nice To Haves

  • Experience with any of the these technologies: Terraform, Airflow, SQL/BigQuery, BigTable or other NoSQL datastores such as Cassandra, DynamoDB, Redis
  • Familiarity with ML tooling such as Triton, TensorFlow, scikit-learn
  • Experience building the infrastructure that power real-world machine learning applications like recommendation systems, bandits, etc.
  • Experience engaging with partners to understand pain points, observe patterns, and identify opportunities for improvements

Responsibilities

  • You will research, develop, and deploy infrastructure for the Machine Learning Platform that supports large-scale multi-tenant workloads
  • You will build a platform to train and test algorithms that provide real-time content recommendations and personalization to our readers
  • You will enhance ML platform's CI/CD and integration testing capabilities
  • You will build a platform that supports the full Machine Learning workflow; from model training, testing and production deployment at high scale and low latency
  • Demonstrate support and understanding of our value of journalistic independence and a strong commitment to our mission to seek the truth and help people understand the world

Benefits

  • dependent on your role, you may be eligible for variable pay, such as an annual bonus and restricted stock.
  • Benefits may include medical, dental and vision benefits, Flexible Spending Accounts (F.S.A.s), a company-matching 401(k) plan, paid vacation, paid sick days, paid parental leave, tuition reimbursement and professional development programs.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service