Staff Software Engineer - ML Platform

Riot GamesLos Angeles, CA
92d

About The Position

Riot Games is seeking a Staff Software Engineer to help evolve our next-generation ML Platform, enabling teams across Riot to seamlessly productionize and operate machine learning models at global scale. In this role, you’ll focus on bringing modern MLOps practices to life; streamlining deployment, simplifying ongoing operations, standardizing infrastructure, and giving our teams of data scientists, ML engineers, and product owners the tools to deliver value faster. As a Staff Software Engineer, you’ll apply MLOps concepts to architect, build, and guide key infrastructure for model deployment, monitoring, and lifecycle management. You’ll design for GPU and CPU workloads, support live testing scenarios (A/B, shadow modes), manage model versioning, and ensure the platform is robust, cost-efficient, and extensible. Your contributions will directly power high-impact ML applications across Riot’s games and internal systems, helping the company scale machine learning in a sustainable, consistent, and developer-friendly way.

Requirements

  • 6+ years of experience in software engineering, with substantial time spent in platform or infrastructure teams.
  • Proven technical leadership in building large scale distributed systems, production ML systems or model serving infrastructure at scale.
  • Deep experience with cloud-native systems (e.g., Kubernetes, containerization, autoscaling, observability stacks).
  • Experience with one or more inference serving frameworks (e.g., NVIDIA Triton, KServe, TorchServe, BentoML, Seldon Core etc).
  • Familiarity with GPU orchestration, performance tuning, and cost-aware scheduling.
  • Strong background in CI/CD automation, IaC tools (e.g., Terraform), and artifact management.
  • Hands-on experience with Python ML ecosystems, package management (Poetry, Conda etc), and vulnerability scanning.
  • Ability to mentor engineers, write clear documentation, and influence cross-functional stakeholders.

Nice To Haves

  • Experience building ML infrastructure within a real-time, or latency-sensitive environment.
  • Familiarity with ML workflow tools (MLFlow, DVC, LakeFS, etc), model lifecycle management, and drift monitoring strategies.
  • Exposure to AB testing and experimentation frameworks, especially in online model evaluation.
  • Prior success in founding or greenfield platform work, especially building toward multi-tenancy or self-service capabilities.
  • Passion for player experience, game systems, or creative technology development.
  • Familiarity/experience with technical deployments in China, particularly in Tencent.

Responsibilities

  • Design and implement ML inference infrastructure, supporting both real-time and nearline batch use cases, with CPU/GPU-aware orchestration and automated deployment pipelines for scalable model serving.
  • Partner with researchers, game teams, and platform engineers to understand product needs and deliver generalizable, reusable solutions.
  • Define and build CI/CD workflows for ML artifacts, supporting rapid iteration and safe promotion from dev to production and MLOps practices.
  • Develop tooling for environment and dependency management (e.g., Conda/Poetry lock files, secure image builds) to ensure reliable, reproducible ML runtimes.
  • Implement platform observability features such as monitoring, drift detection, resource utilization, and latency tracking.
  • Establish patterns and tooling for multi-version model support, blue/green and shadow deployments, and rollback.
  • Be thoughtful on developer UX and incorporate an iterative approach to improving.
  • Provide technical leadership by shaping platform architecture, mentoring engineers, and contributing to team growth.
  • Contribute upstream to shared infra initiatives and build a feedback loops and collaboration models with other Riot platform teams.

Benefits

  • Open paid time off policy.
  • Flexible work schedules.
  • Medical, dental, and life insurance.
  • Parental leave for you, your spouse/domestic partner, and children.
  • 401k with company match.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service