Sr. Site Reliability Engineer (SRE)

Avenue CodeMountain View, CA
78d$160,000 - $166,000

About The Position

We're seeking an experienced, highly collaborative SRE to partner with product teams and tackle our most critical infrastructure challenges. You'll be hands-on in designing, building, and operating our cloud platform-and driving the reliability, performance, and security that empower our engineering organization.

Requirements

  • Have 5+ years of experience running production critical systems.
  • Deep proficiency with the AWS Cloud and Cloud-Native best practices.
  • Experience with Kubernetes (EKS, GKE) and Container Orchestration at scale.
  • Skilled in Terraform to declaratively provision and maintain infrastructure services.
  • Working knowledge of managing and debugging databases like Redis and Postgres.
  • Strong familiarity with VPC, VPN, Load Balancing, and cloud networking components.
  • Proficiency with Git workflows, branching strategies, and CI/CD system integrations.
  • Solid understanding of web and network protocols and standards (HTTP, REST, TLS, DNS, etc...).
  • Professional proficiency in English (both written and spoken) is required for this role.

Nice To Haves

  • Bachelor's degree, or equivalent in Computer Science, Engineering, or a related field.
  • Experience with ArgoCD, Github Actions, Jenkins, or other CI/CD pipeline solutions.
  • Working knowledge of Python, Golang, and Helm templating languages.
  • Node.js experience a plus, including running scalable, resilient Node microservices.
  • Grasp of foundational security best practices for cloud infrastructure.
  • Awareness of Terragrunt, managing Terraform state, and optimal project structure.
  • Seasoned in production readiness fundamentals amidst a fast moving team.

Responsibilities

  • Automate provisioning and deployments with Terraform and integrate best-practice pipelines (GitHub Actions, ArgoCD, etc.).
  • Define SLIs/SLOs, manage error budgets, and build dashboards & alerts to proactively measure and improve system health.
  • Enforce least-privilege IAM policies, automate vulnerability scans, and maintain audit logging for compliance.
  • Instrument services with metrics, logs, and distributed tracing to enable rapid troubleshooting, aid teams in alerting, custom metrics, and dashboarding.
  • Own on-call rotations, lead real-time incident response, conduct post-mortems, and drive continuous improvements.
  • Implement tagging strategies, right-size resources, and leverage concrete data to decide on optimal methods to control cloud spend at scale.
  • Author runbooks, standards, and best-practice guides-and coach dev teams on implementing modern DevOps, reliability, and security patterns.

Benefits

  • Avenue Code discloses salary range information based on our commitment to fairness and transparency.
  • A reasonable estimate of the current range for a Sr. SRE is from $160,000.00 to $166,000.00 yearly.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Career Level

Senior

Industry

Professional, Scientific, and Technical Services

Education Level

Bachelor's degree

Number of Employees

101-250 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service