Site Reliability Engineer Jobs

794 jobs found — updated daily

Site Reliability Engineer ID53670

AgileEngineDowney, CA
Hybrid

About The Position

We are looking for a Middle SRE Operations Engineer to maintain reliability across a cloud-based SaaS platform. You’ll handle live incidents, improve observability, and reduce toil through automation using Kubernetes, Terraform, Grafana, and AWS. This role is hands-on, execution-focused, with real ownership across CI/CD pipelines, GitOps workflows, and on-call rotations.

Requirements

  • 2+ years of experience in Site Reliability Engineering, DevOps, or Production Operations
  • Experience with AWS supporting production environments
  • Experience supporting production SaaS applications
  • Strong understanding of CI/CD systems (GitHub Actions, Jenkins, CircleCI)
  • Experience with GitOps and Git fundamentals
  • Experience using GitHub, Jira, and Confluence
  • Experience with Kubernetes (EKS, kOps or similar)
  • Experience with Docker and containerization
  • Experience with observability tools (Grafana, Prometheus, Loki, PagerDuty)
  • Proficiency in scripting (Bash, Python, or Go)
  • Experience with Infrastructure as Code (Terraform, Helm)
  • Ability to work within structured operational processes and SLAs
  • Strong written and verbal English communication skills
  • Self-driven with a growth mindset

Nice To Haves

  • AWS certifications such as Solutions Architect, DevOps Engineer, or SysOps Administrator
  • Experience with multi-tenant SaaS environments
  • Experience working in globally distributed teams
  • Familiarity with ChatOps practices
  • Experience improving monitoring quality and reducing alert fatigue

Responsibilities

  • Monitor and support production and staging environments to ensure availability, performance, and stability
  • Respond to incidents, perform triage and root cause analysis, and contribute to remediation efforts
  • Participate in on-call rotations with defined SLAs
  • Handle operational requests from internal teams
  • Maintain and improve monitoring, alerting, dashboards, logs, and metrics
  • Support CI/CD pipelines, production releases, and GitOps workflows
  • Contribute to automation initiatives to reduce operational overhead
  • Maintain and improve Kubernetes-based infrastructure and containerized workloads
  • Support Infrastructure as Code practices and environment improvements

Benefits

  • Professional growth: Mentorship, TechTalks, and personalized growth roadmaps.
  • Competitive compensation: USD-based pay with education, fitness, and team activity budgets.
  • Exciting projects: Modern solutions with Fortune 500 and top product companies.
  • Flextime: Flexible schedule with remote and office options.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

251-500 employees

Career Resources

Build a Resume for Site Reliability Engineer

The resume builder that gets results.

  • Get clear feedback so you look as qualified as you are
  • Align your resume with the job to get further in the process, faster
  • Take the guesswork out of resume writing

Explore Related Job Searches

© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service