Overview We are seeking a Senior Site Reliability Engineer (SRE) to help develop our platform operations across Windows, Linux, and cloud-native environments. This role is central to our transformation from app-specific support to platform-wide reliability engineering. You will bring deep expertise in Google Cloud Platform (GCP), container orchestration, and automation, enabling scalable, secure, and resilient infrastructure that supports diverse applications across our enterprise. Key Responsibilities Platform Reliability & Cloud Engineering Ensure high availability, performance, and security of production systems across Windows, Linux, and GCP environments. Engineer and support containerized workloads using Kubernetes (GKE) and Docker, enabling scalable microservices architectures. Lead infrastructure provisioning and configuration using Terraform, Ansible, and GCP-native tools. Automation & Observability Develop automation scripts and pipelines to eliminate manual toil and accelerate incident response. Implement observability frameworks using SLIs/SLOs, Prometheus, Grafana, and GCP Operations Suite. Drive proactive monitoring, alerting, and telemetry across hybrid environments. Incident Management & Resilience Lead incident response, root cause analysis, and postmortems. Build self-healing systems and automated remediation workflows using GCP-native services and scripting. Security & Compliance Collaborate with InfoSec to enforce hardening standards, manage vulnerabilities, and support compliance initiatives. Integrate security into CI/CD pipelines and container platforms using IAM, encryption, and policy enforcement. Collaboration & Enablement Partner with developers, application owners, and infrastructure teams to deliver reliable, cloud-native platforms. Document configurations, runbooks, and operational procedures to enable cross-team reuse and transparency.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
5,001-10,000 employees