Sr. Site Reliability Engineer

IllumioSunnyvale, CA
Onsite

About The Position

Illumio is seeking an experienced Senior Site Reliability Engineer (SRE) with a strong background in AWS & Azure cloud platforms. This role is crucial for ensuring the reliability, scalability, and performance of our cloud-based systems and applications. The ideal candidate will have hands-on experience supporting and managing AWS and Azure infrastructure, coupled with a passion for automation, continuous improvement, and collaboration. Illumio is a leader in ransomware and breach containment, redefining how organizations contain cyberattacks and enable operational resilience. Our breach containment platform uses the Illumio AI Security Graph to identify and contain threats across hybrid multi-cloud environments, stopping attacks before they become disasters. We are recognized as a Leader in the Forrester Wave™ for Microsegmentation, enabling Zero Trust and strengthening cyber resilience. Our Engineering team is shaping the future of cybersecurity, fostering a culture of innovation, autonomy, and ownership. We are redefining security for a world facing unprecedented cyber threats and work with a highly scalable SaaS service built using cloud-native technologies while simultaneously shipping the solution on-premises. Our guiding philosophy in Engineering is to get things right through disciplined engineering, focusing, not cutting corners, and having fun. We believe in enabling ownership at all levels and empowering teams.

Requirements

  • Bachelor’s degree in computer science, Engineering, or related field; or equivalent work experience
  • 5+ years of experience working as a Site Reliability Engineer (SRE) or similar role, with a focus on AWS and/or Azure cloud platform
  • Hands-on experience in designing, deploying, and managing AWS and/or Azure infrastructure, including compute, storage, networking, and security services
  • Proficiency in scripting and programming languages such as PowerShell, Python, or Go for automation and infrastructure management tasks
  • Strong understanding of CI/CD principles and experience with tools such as Azure DevOps, Jenkins, or GitLab CI/CD
  • Excellent analytical, problem-solving, and communication skills, with the ability to collaborate effectively with cross-functional teams

Nice To Haves

  • Experience with containerization technologies (e.g., Docker, Kubernetes) and microservices architecture in AWS and Azure environments is a plus
  • AWS or Azure certifications such as AWS/Azure Solutions Architect, Azure DevOps Engineer, or Azure Security Engineer are preferred

Responsibilities

  • Monitor system performance, application health, and infrastructure metrics using monitoring and logging services, and implement proactive measures to optimize performance and availability
  • Oncall duty for production uptime and support for customer escalations
  • Release upgrades and maintenance activities including hotfixes and infrastructure updates
  • Lead incident response and resolution efforts, conducting root cause analysis, implementing corrective actions, and documenting post-incident reviews
  • Implement security best practices and controls in the cloud environments to protect data, applications, and infrastructure, and ensure compliance with regulatory requirements
  • Drive continuous improvement initiatives to enhance reliability, scalability, and efficiency of infrastructure and services, leveraging automation and emerging technologies
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service