The Site Reliability Engineer will play a critical role in maintaining and scaling complex systems, ensuring the reliability, performance, and availability of infrastructure across cloud and on-premises environments. This role blends deep technical expertise in Linux systems, virtualization, container orchestration, Kubernetes, and CI/CD pipelines with proactive monitoring and operational excellence. You will collaborate closely with development and platform teams to implement best practices, automate workflows, and manage high-throughput services in large-scale datacenters. The position offers the opportunity to influence architecture, improve system resilience, and participate in incident response and root cause analysis. Ideal candidates thrive in fast-paced, distributed teams, are comfortable with both strategic planning and hands-on implementation, and are passionate about building robust and scalable systems.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level