Senior Site Reliability Engineer

RoNew York, NY
Hybrid

About The Position

At Ro, our mission is to provide world-class healthcare by putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the infrastructure team, you’ll sit at the core of that effort: contributing to the reliability of our production systems, hardening infrastructure and building tools that empower our engineers to ship safely and confidently. You will work across teams to drive uptime, performance and observability – partnering closely with product, platform and security engineers. From designing resilient systems to shaping incident response practices, this is a role for engineers who thrive on impact and care deeply about operational excellence.

Requirements

  • Strong understanding of systems and infrastructure, with experience operating distributed services in production. We are mostly in AWS and leverage a lot of its primitives - EKS, RDS, Route53, S3, Elasticache to name a few
  • Strong programming and automation skills using Go or Python
  • Proficiency with infrastructure as code - Terraform / Pulumi
  • A passion for observability, with hands-on experience in metrics, logging, tracing using Datadog
  • Solid cross-functional communication, able to collaborate with product, platform, security and other teams
  • An operational mindset that puts reliability and resilience as a core product requirement
  • A mission-driven attitude, motivated by the opportunity to make healthcare better.

Responsibilities

  • Design and implement resilient infrastructure to support high availability at scale
  • Build and contribute to tools and platforms that streamline deployment, monitoring and recovery of systems
  • Drive incident response and harness learnings, leading efforts to minimize downtime and improve MTTR
  • Partner with engineering teams to bake best practices for reliability, resilience and observability into services
  • Automate infrastructure workflows using IaC and other cloud native tools
  • Contribute to our culture of operational excellence, guiding engineers through reliability practices and raising the bar across the engineering org

Benefits

  • Full medical, dental, and vision insurance + OneMedical membership
  • Healthcare and Dependent Care FSA
  • 401(k) with company match
  • Flexible PTO
  • Wellbeing + Learning & Growth reimbursements
  • Paid parental leave + Fertility benefits
  • Pet insurance
  • Student loan refinancing
  • Virtual resources for mindfulness, counseling, and fitness
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service