Senior Software Engineer, MLOps

RivianPalo Alto, CA

About The Position

The Autonomy org at Rivian is seeking a Staff Software Engineer, Data Ops to join the Data team who can provide expertise in cloud and data engineering and collaborate with technical and business users. This candidate needs to have a very good understanding of the AWS Cloud Data Platform and Data Ops processes that helps to build, test, and release complex mission critical infrastructure services for Rivian's ADAS team on AWS cloud. In this role you will work with the ADAS Cloud, Data, Perception, SIL/HIL, Vehicle integrations & Vehicle Cloud teams, Product Management, and other Technology Partners to leverage best practices and reference architectures highlighting AWS Cloud Platform and Data/Dev/ML Ops practices.

Requirements

  • 5+ years of software engineering or in ML/Dev/Data Ops role.
  • 5+ years of experience authoring, scaling, and managing production infrastructure.
  • 5+ years of experience working with Kubernetes, AWS CI/CD tools, AWS networking stack, S3, Lambda, EKS, ECS, RDS, System Manager, Secrets Manager, CloudTrail, etc.
  • 5+ years Infra as Code and configuration management (Terraform, AWS cloud formation, AWS CDK).
  • 5+ years of experience with monitoring applications in cloud using Datadog, AWS CloudWatch or Prometheus.
  • 5+ years of eperience debugging production systems and performing RCA on incidents.
  • 3+ years of being hands-on with Python, Go or Java and Gitlab for automation.
  • 2+ years of CI/CD and/or GitOps patterns (using Gitlab, Jenkins, Allure etc.)
  • 2+ years of microservice-oriented architectures (using Kubernetes (EKS), AWS ECS, or Docker swarm).
  • 2+ years of knowledge of Agile Development of Accessible Software Tools.

Nice To Haves

  • Linux internals, networking, and distributed computing are a plus.
  • AWS or Cloud Native certification is a plus.

Responsibilities

  • Lead, build, test and release complex mission-critical infrastructure services for Rivian's ADAS team on cloud and/or on-prem.
  • Setup fault tolerant multi-region environments for data operations and data applications.
  • Own CI/CD pipeline for apps and data projects.
  • Define on-call strategy and participate in on-call rotations.
  • Make developers' lives smooth via automated workflows.
  • Build and optimize highly reliable, scalable, and distributed infra using microservice architecture.
  • Collaborate with the security & privacy team to perform audits and mitigate any findings.
  • Collaborate with cross-functional ADAS teams for development and integrations.
  • Cost optimization in AWS across multiple accounts and services.

Benefits

  • paid vacation
  • paid sick leave
  • life insurance
  • medical insurance
  • dental insurance
  • vision insurance
  • short-term disability insurance
  • long-term disability insurance
  • 401(k) Plan
  • Employee Stock Purchase Program
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service