Staff Engineer, Site Reliability

Babylist
$226,673 - $271,991Remote

About The Position

Babylist's Platform team is the foundation every engineering team builds on — and this role is at the center of keeping it reliable, fast, and scalable. As a Staff SRE, you'll own the infrastructure and reliability practices that support 9 million+ users and the engineers who build for them. Babylist started as an e-commerce and registry platform, and we're actively growing beyond that — into health, media, mobile, and new product surfaces that don't exist yet. The Platform team is the foundation that makes all of it possible. This isn't a maintenance role — you'll be actively evolving how we build and operate AWS infrastructure, CI systems, and developer tooling. You'll work cross-functionally across all of Babylist Engineering, which means your decisions have wide leverage.

Requirements

  • Deep hands-on Terraform expertise — you own IaC, not just contribute to it
  • Proven AWS experience at scale — EKS, RDS, cloud networking, DNS, CDNs, load balancers — you know the gotchas
  • Experienced operating Kubernetes in production — you've debugged the hard stuff, not just deployed the easy stuff
  • Comfortable designing and improving CI/CD systems — CircleCI, GitHub Actions, or similar; you care about developer velocity, not just pipeline uptime
  • Strong observability instincts — Datadog, Sentry, PagerDuty, Cronitor — you build alerting that's actionable, not noisy
  • Experienced with on-call and incident management — you've run the post-mortems and actually changed things afterward
  • Comfortable supporting developers across local, staging, and production — you're a resource, not a gatekeeper
  • You naturally reach for AI in your work — at Babylist, every team uses AI daily. You're already using it to move faster and improve your output, and you stay curious about what's coming next.

Responsibilities

  • Manage and evolve our AWS environment using Terraform, keeping EKS clusters, databases, and core services current and performant
  • Own the speed and reliability of our CI systems for the full Engineering org — every deploy starts here
  • Be the person engineers turn to when environments break; unblock them fast across local, staging, and production
  • Establish and socialize best practices for monitoring & alerting so the right people get paged for the right reasons
  • Lead or support incident response, drive post-incident reviews, and close the loop so the same thing doesn't happen twice
  • Contribute to architectural decisions that shape how Babylist's infrastructure evolves over the next several years

Benefits

  • Competitive salary with equity and bonus opportunities
  • Company-paid medical, dental, and vision insurance
  • Retirement savings plan with company matching and flexible spending accounts
  • Generous paid parental leave and PTO
  • Remote work stipend to set up your office
  • Perks for physical, mental, and emotional health, parenting, childcare, and financial planning
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service