Staff Software Engineer

Warner Bros. DiscoveryAtlanta, GA

About The Position

The AI Enablement & Machine Learning team at CNN is accelerating our digital transformation through strategic applications of machine learning and AI technologies. The ML Platform group within ML Foundations builds and maintains the infrastructure, deployment tooling, and observability that enable CNN's Machine Learning and AI Systems teams to move from prototype to production with velocity and confidence. We support diverse model architectures — two-tower, bandits, LLM- based systems, and traditional ML — across rapid experimentation and scaled production deployment. Our vision is that CNN's ML and AI teams operate with velocity and confidence, supported by infrastructure that handles everything from rapid experimentation to scaled production deployment across diverse model types and use cases. As a Staff Software Engineer on ML Platform, you will work across teams to design, build, and operate the infrastructure foundations that power model training, serving, experimentation, and observability for our Machine Learning and AI Systems teams. You will partner with ML engineers, data engineers, and AI Systems engineers to understand production needs, build reliable infrastructure, and deliver tooling that accelerates the team.

Requirements

  • 8+ years building production infrastructure or platform systems, with a Bachelor's degree in Computer Science, Information Technology, or a related technical field (or 6+ years with a Master's degree)
  • Deep expertise in distributed systems, with a track record of shipping highly available, low-latency infrastructure
  • Strong proficiency in Python and at least one of Go, Java, or C++
  • Expertise with cloud infrastructure and IaC, especially AWS and Terraform
  • Experience with ML or data infrastructure — orchestration, serving, deployment tooling, observability, or experimentation frameworks
  • Proven track record of leading complex platform projects from concept to production — knowing when to own decisions, when to rally the right people for alignment, and when to escalate
  • Collaborative mindset, understanding that great platform work depends on deep partnership with the teams you serve
  • A passion for helping CNN's engineering organization grow through mentorship, talent acquisition, and professional development

Nice To Haves

  • Experience with Metaflow, SageMaker, or comparable ML orchestration platforms
  • Experience with model registries, feature stores, or experimentation frameworks
  • Background in cost governance, FinOps, or multi-tenant infrastructure
  • Practical experience supporting LLM-based or GenAI production systems
  • Prior experience working closely with machine learning engineers

Responsibilities

  • Design and own infrastructure, deployment tooling, and developer experience for ML and AI Systems teams
  • Lead architectural decisions across orchestration, serving, observability, and experimentation infrastructure
  • Build self-service tooling that lets ML practitioners move from prototype to production without platform team dependencies
  • Establish engineering standards for ML/AI infrastructure, including reliability, cost governance, and operational excellence
  • Review designs and code, mentor engineers, and lead cross-team initiatives
  • Partner with Data Platform on infrastructure coordination and data access patterns
  • Communicate effectively across audiences — technical documentation, design reviews, and stakeholder interactions

Benefits

  • career defining opportunities
  • thoughtfully curated benefits
  • the tools to explore and grow into your best selves
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service