As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable, secure, and observable cloud-native systems that support mission-critical applications and services. You will blend software engineering, DevOps practices, and infrastructure expertise to improve system reliability, performance, and operational excellence across the platform. Contributions Responsibilities Establishing development tools and infrastructure for automation. Understanding the needs of stakeholders and conveying this to developers. Automate and improve development, testing, deployment, and release processes. Testing and examining code written by others and analyzing results. Own and improve the reliability, availability, and performance of production systems and services. Define, implement, and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets. Perform capacity planning, scalability analysis, and performance tuning for applications and infrastructure. Participate in on-call rotations, incident response, and post-incident reviews to drive long-term improvements. Design and implement infrastructure-as-code (IaC) to provision and manage cloud resources (e.g., AWS, Azure, GCP). Build and maintain CI/CD pipelines to ensure reliable, repeatable delivery of application and infrastructure changes. Engineer resilient architectures using concepts such as auto-scaling, blue/green deployments, canary releases, and self-healing patterns. Collaborate with security and platform teams to ensure infrastructure adheres to compliance, security, and governance requirements. Collaborate with application development teams to design reliable, observable, and operable services from the outset. Contribute to application code, tooling, and frameworks that enhance reliability, resilience, and performance. Act as an individual contributor and mentor more junior team members. Present regular status updates and provide cross-training to other DevOps team members.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
251-500 employees