Senior Site Reliability Engineer, TS Clearance

Anduril IndustriesColumbia, CA
Hybrid

About The Position

We are looking for a Site Reliability Engineer (SRE) to join AGD, our rapidly growing team in Costa Mesa, CA or Washington DC. SREs work with external stakeholders to determine the technical direction of cloud deployments and deliver with speed through analysis, design and code. They are comfortable leading large, focused projects. They lead in the development of Kubernetes cloud infrastructure, DevOps, CI/CD and improving the developer experience. You will be managing cloud deployments in AWS, Azure and on premise. This role emphasizes the continuous innovation in improving our cloud computing environments.

Requirements

  • Holding active U.S. TOP SECRET security clearance
  • 6+ years of engineering experience
  • Technical expertise and demonstrated performance in one or more of the following areas: networking, cloud technologies, application development and/or cybersecurity
  • Deep knowledge of the Kubernetes ecosystem (Docker, Helm, ArgoCD, Terraform)
  • Experience with cloud services (AWS/Azure)
  • Experience in software languages such as Go, Python, Rust, or C++
  • Experience performing data-driven root cause analysis on complex systems
  • Demonstrated ability to train peers or customers on the operation of a product
  • Computer Science degree or equivalent

Nice To Haves

  • Experience with managing Kubernetes clusters of hundreds of nodes
  • Knowledge of performance improvement techniques, metrics and alerting
  • Experience with KubeVirt, qemu, virtualization and hypervisor technologies
  • Experience with low-level frameworks, Linux and databases
  • Excellent written and verbal communication skills

Responsibilities

  • Architect, deploy and maintain infrastructure with cloud providers and Kubernetes (EKS)
  • Collaborate with multi-disciplined teams to define and execute on internal and external deployments
  • Promote SRE best practices in system resilience, performance monitoring and high availability
  • Design, develop, and deliver solutions using infrastructure as code with tools like Terraform and Python
  • Develop and maintain CI/CD pipelines for automated deployment
  • Build strong relationships with internal and external customers to identify technical solutions to their problems
  • Improve Anduril’s operational capabilities by improving our core product offering through root cause analysis and creating tooling capable of managing large scale deployments
  • Lead the organization in building scalable, sustainable mechanisms to continue delivering to customers at the pace the business is scaling

Benefits

  • Highly competitive equity grants are included in the majority of full time offers; and are considered part of Anduril's total compensation package.
  • top-tier benefits for full-time employees
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service