Staff Software Engineer, Machine Learning Platform

DiscordSan Francisco, CA
77d$272,000 - $306,000

About The Position

The Machine Learning Platform team at Discord is responsible for the end to end model development lifecycle across all ML applications. We work closely with machine learning engineers, core infrastructure, and ML consumers to deliver the tools, services, and frameworks that make machine learning training and deployments easy, safe, scalable, and accessible. In this role, you'll work on everything from training and batch inference pipelines, online model serving, feature stores, event stream processing, and LLM serving and evaluation. You'll lead projects and partner with ML practitioners and other Staff+ engineers to shape the future of Discord's data infrastructure. This role reports to the Senior Engineering Manager of the ML Platform team.

Requirements

  • 8+ years of experience as a software or ML engineer, working with large-scale data or distributed systems
  • 4+ years of experience building platforms and infrastructure primarily used by engineering stakeholders
  • 2+ years of experience building machine learning platforms that support the full model lifecycle, including orchestration systems (e.g. Dagster, Flyte, or Airflow)
  • Proven ability to establish and execute on clear, actionable, and accurate technical direction beyond the next month or quarter
  • Experience leading technical initiatives across teams with diverse resources, skillsets, and priorities

Nice To Haves

  • Experience operating, monitoring, and maintaining large-scale production systems, especially on Kubernetes and/or in large data domains
  • Experience with real-time data processing (e.g. Kafka, Spark, or Flink)
  • Experience building ML models using modern frameworks (e.g. PyTorch, TensorFlow, or JAX)
  • Experience deploying GenAI applications, eval frameworks, or agents
  • Experience managing infrastructure costs by designing for efficiency and scalability

Responsibilities

  • Design and build the platform that ML engineers and data scientists rely on to deliver delightful user experiences and keep communities safe
  • Evaluate and integrate new ML frameworks and tools to ensure Discord stays at the forefront of a fast-moving field, including LLMs and generative AI
  • Partner with model builders to help them ship high-quality models quickly and often
  • Champion best practices in feature engineering and ML infrastructure
  • Build and maintain shared ML foundations, including datasets and embedding models, that elevate the overall quality of ML applications

Benefits

  • Equity
  • Comprehensive benefits package

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Web Search Portals, Libraries, Archives, and Other Information Services

Number of Employees

501-1,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service