Staff Software Engineer, Machine Learning

Walmart
92d$145,600 - $270,400

About The Position

We’re looking for a Staff Software Engineer for our MLOps Platform to join the AI/ML organization, focusing on supporting applications of AI to video. In this role, you will help build a robust, scalable platform that enables the training, deployment, and observability of machine learning models for video understanding and classifications. You will drive platform architecture, lead critical components, and work across teams to establish engineering excellence in the ML infrastructure layer.

Requirements

  • 7+ years of software engineering experience building scalable backend systems.
  • Deep experience with cloud platforms (AWS, GCP); ability to design for cloud-agnostic deployment.
  • Experience working with production deployment of machine learning systems.
  • Proficiency in Python and Go, with strong software design and architecture skills.
  • Proven experience implementing and managing CI/CD workflows and IaC with tools like Terraform or CloudFormation.
  • Familiarity with Agile development practices and a strong operational mindset.
  • Experience in building and maintaining observability stacks (e.g., Prometheus, Grafana, OpenTelemetry).
  • Strong understanding of media processing and video engineering concepts including transcoding, segmentation, and frame manipulation, using libraries such as FFMPEG, OpenCV, and Demux.
  • Experience collaborating in a cross-functional team and setting engineering standards at scale.

Responsibilities

  • Design and lead the development of core components of the machine learning platform with a focus on scalability, reliability, and cloud-agnostic principles.
  • Build APIs and microservices that power end-to-end machine learning workflows, including model deployment, orchestration, and observability.
  • Collaborate with content protection and security teams to ensure robust access controls and compliance.
  • Own cloud infrastructure and CI/CD automation using Infrastructure-as-Code (IaC) principles.
  • Champion engineering excellence through well-tested code, clear documentation, and continuous improvement of development and deployment practices.
  • Establish and enforce best practices in testing, code quality, and monitoring for ML pipeline components.
  • Work closely with Video AI engineers and product managers to support evolving use cases like scene segmentation, video annotation, clip generation, and metadata enrichment.

Benefits

  • Health insurance coverage
  • Employee wellness program
  • Life and disability insurance
  • Retirement savings plan
  • Paid holidays and sick time
  • Vacation
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service