About the position
We are looking for an experienced Senior Site Reliability Engineer to join our team and contribute to the automation, performance, and reliability of our cloud-based infrastructure. As part of the compute teams at Zapier, you will play a crucial role in maintaining and scaling our container hosting platform based on Kubernetes. The ideal candidate should have a strong background in systems administration, systems engineering, or software development, with a focus on Site Reliability Engineering or DevOps. Additionally, proficiency in cloud-based infrastructure, infrastructure as code tools, and coding languages like Python or Go is required. Effective communication skills and a commitment to our company values are also essential for success in this role.
Responsibilities
Requirements
- 4 years of experience in SaaS companies in systems administration, systems engineering, or software development
- At least 2 years of experience in Site Reliability Engineering or DevOps
- Experience in designing or maintaining highly available, cloud-based infrastructure in AWS or another cloud offering
- Familiarity with infrastructure as code tools and best practices for reliability and observability
- Proficiency in coding with languages like Python or Go
- Ability to solve complex systems challenges and improve performance
- Excellent communication skills, both written and verbal
- Alignment with Zapier's values and ability to thrive in a collaborative setting
- Experience in designing and deploying AWS infrastructure using infrastructure as code tools
- Knowledge of Kubernetes clusters and serverless functions
- Ability to evaluate and recommend new tools and technologies
- Strong problem-solving skills and ability to apply SRE principles
- Experience in building and integrating systems, handling high-traffic workloads, and performing migrations
- Familiarity with GitLab, Prometheus, Redis, and Datadog
- Ability to build new features and services
- Strong focus on automation and building robust and resilient systems
- Familiarity with rapid deployment pipelines and continuous delivery practices
- Understanding of compensation principles and practices
Benefits
- Supportive team to solve contributing causes and learn from failures
- Opportunity to build robust and resilient systems for customers
- Emphasis on automation to solve problems
- Built dozens of new microservices in Kubernetes
- Contribution to open-source Kubernetes projects
- Open-sourced internal tools like Kubechecks and TFBuddy
- Handling millions of daily requests
- Experience with major migrations between hosting providers, database systems, and stateful technologies
- Creation of rapid deployment pipelines in GitLab and ArgoCD for continuous delivery
- Ability to bootstrap new Kubernetes services with ease
- Competitive compensation in the technology sector
- Equitable pay practices based on competencies
- Simple and transparent pay practices
- Clear understanding of how pay is determined
- Pay ranges for the role in Australia and New Zealand
- Finalized compensation package based on experience, competencies, and internal equity
- Competency-based approach to base pay
- Upper half of pay range reserved for high-performing individuals
- Non-standard application process designed to promote inclusion and equity
- Encouragement to apply even if skills and experiences don't exactly match the job description
- Prompt communication throughout the application process
- Equal-opportunity employer committed to diversity and inclusion
- Consideration of all qualified applicants, including those with criminal histories
- Commitment to providing reasonable accommodations for individuals with disabilities
- All-remote company with restrictions on permanent work locations