Site Reliability Engineer Jobs

794 jobs found — updated daily

Site Reliability Engineer ID53670

AgileEngine•Downey, CA

5d•Hybrid

About The Position

We are looking for a Middle SRE Operations Engineer to maintain reliability across a cloud-based SaaS platform. You’ll handle live incidents, improve observability, and reduce toil through automation using Kubernetes, Terraform, Grafana, and AWS. This role is hands-on, execution-focused, with real ownership across CI/CD pipelines, GitOps workflows, and on-call rotations.

Requirements

2+ years of experience in Site Reliability Engineering, DevOps, or Production Operations
Experience with AWS supporting production environments
Experience supporting production SaaS applications
Strong understanding of CI/CD systems (GitHub Actions, Jenkins, CircleCI)
Experience with GitOps and Git fundamentals
Experience using GitHub, Jira, and Confluence
Experience with Kubernetes (EKS, kOps or similar)
Experience with Docker and containerization
Experience with observability tools (Grafana, Prometheus, Loki, PagerDuty)
Proficiency in scripting (Bash, Python, or Go)
Experience with Infrastructure as Code (Terraform, Helm)
Ability to work within structured operational processes and SLAs
Strong written and verbal English communication skills
Self-driven with a growth mindset

Nice To Haves

AWS certifications such as Solutions Architect, DevOps Engineer, or SysOps Administrator
Experience with multi-tenant SaaS environments
Experience working in globally distributed teams
Familiarity with ChatOps practices
Experience improving monitoring quality and reducing alert fatigue

Responsibilities

Monitor and support production and staging environments to ensure availability, performance, and stability
Respond to incidents, perform triage and root cause analysis, and contribute to remediation efforts
Participate in on-call rotations with defined SLAs
Handle operational requests from internal teams
Maintain and improve monitoring, alerting, dashboards, logs, and metrics
Support CI/CD pipelines, production releases, and GitOps workflows
Contribute to automation initiatives to reduce operational overhead
Maintain and improve Kubernetes-based infrastructure and containerized workloads
Support Infrastructure as Code practices and environment improvements

Benefits

Professional growth: Mentorship, TechTalks, and personalized growth roadmaps.
Competitive compensation: USD-based pay with education, fitness, and team activity budgets.
Exciting projects: Modern solutions with Fortune 500 and top product companies.
Flextime: Flexible schedule with remote and office options.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

251-500 employees

Job Search Resources

Career Resources

Site Reliability Engineer Career GuideSkills, salary, and career path

Site Reliability Engineer Resume ExamplesSamples and writing tips

Site Reliability Engineer Cover Letter ExamplesTemplates and best practices

Build a Resume for Site Reliability Engineer

The resume builder that gets results.

Get clear feedback so you look as qualified as you are
Align your resume with the job to get further in the process, faster
Take the guesswork out of resume writing

Explore the resume builder

Explore Related Job Searches

Engineer Manager Director Analyst Software Engineer Rn

Over 4 Million Users

Free AI tools and resources to help you land your next job, faster

Site Reliability Engineer Jobs