Site Reliability Engineer II

IllumioSunnyvale, CA
6h

About The Position

Onwards Together! Illumio is the leader in ransomware and breach containment, redefining how organizations contain cyberattacks and enable operational resilience. Powered by the Illumio AI Security Graph, our breach containment platform identifies and contains threats across hybrid multi-cloud environments – stopping the spread of attacks before they become disasters. Recognized as a Leader in the Forrester Wave™ for Microsegmentation, Illumio enables Zero Trust, strengthening cyber resilience for the infrastructure, systems, and organizations that keep the world running. Our Team's Vision: Our Engineering team is driven by a culture that thrives on visionary leadership, autonomy, and ownership, creating a dynamic synergy that drives us forward in the ever-evolving landscape of cybersecurity. When you join our team, you become part of the leader in Zero Trust Segmentation. You'll work with a cutting-edge technology stack that spans operating systems, distributed applications, and immersive UI/visualization tools. We're shaping the future of cybersecurity. And together, we will continue to build world-class products—led by people with different perspectives, backgrounds, and a commitment to innovation in a time when the world faces its greatest cybersecurity threats in history. Your Impact: As an SRE Engineer II, you will be responsible for managing our multi-cloud infrastructure on Azure, AWS and/or GCP. As and when required, you will be responsible for designing new services and applications in the cloud(s) and take them from development to production while working closely with Engineering, SRE/OPS, and Security teams. On a day-to-day basis, you will work on enhancing system reliability and scalability of Illumio SaaS products, and drive continuous improvement initiatives. The ideal candidate will have a passion for cloud technology, automation, and collaboration, along with a solid foundation in Azure cloud platform and related DevOps practices.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or related field; or equivalent work experience
  • 2+ years of experience working as an SRE, DevOps Engineer, or similar role, with hands-on experience in Azure cloud platform in a production environment setting
  • Proficiency in scripting and programming languages such as PowerShell, Python, or Go for automation and infrastructure management tasks
  • Experience with CI/CD tools and methodologies, containerization technologies, and microservices architecture in cloud environments
  • Strong analytical, problem-solving, and communication skills, with the ability to collaborate effectively with cross-functional teams

Nice To Haves

  • Exposure to AWS and/or GCP cloud platforms is preferred
  • Azure certifications such as Azure Administrator, Azure Developer, or AWS/GCP certifications are a plus

Responsibilities

  • Design, deploy, and maintain cloud infrastructure solutions on Azure, AWS, and/or GCP to support our applications and services
  • Implement infrastructure as code (IaC) principles using tools such as Terraform, ARM templates, or CloudFormation to automate provisioning and configuration management
  • Develop and maintain CI/CD pipelines for automated software delivery and deployment, leveraging tools such as Azure DevOps, AWS CodePipeline, or Jenkins
  • Monitor system performance, application health, and infrastructure metrics using cloud monitoring and logging services, and implement proactive measures to optimize performance and availability
  • Support incident response and resolution efforts, conduct root cause analysis, implement corrective actions, and document post-incident reviews
  • Collaborate with Engineering teams to design and implement scalable and reliable architectures, providing guidance on best practices for cloud-native application development
  • Implement security best practices and controls in cloud environments to protect data, applications, and infrastructure, and ensure compliance with regulatory requirements
  • Drive automation initiatives to streamline operational tasks, reduce manual effort, and improve overall efficiency in cloud operations
  • Stay current with cloud platform updates, trends, and best practices, and evaluate emerging technologies for potential adoption to drive innovation and efficiency
  • Provide support and guidance to junior team members, fostering a culture of learning, collaboration, and continuous improvement within the SRE/DevOps team
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service