About The Position

We are investing in our cloud platform as a core product and reliability as a competitive advantage. As we scale, we need a senior engineering leader who can own our cloud platform foundations end-to-end: spanning reliability, infrastructure, and the software systems that power our control plane. This role leads two closely related teams: SRE / Infrastructure - reliability, scalability, observability, and operational excellence Cloud Control Plane - Backend and platform software engineers building APIs, distributed services, and systems that power our multi-tenant cloud offering. This is a hands-on, player-coach role with meaningful technical influence and people leadership.

Requirements

  • 5+ years in senior technical leadership roles (SRE, Infrastructure, Platform, or Cloud Engineering), including at least 2 years of people management experience, ideally across more than one team
  • Track record of balancing hands-on technical leadership with people leadership
  • Strong grasp of SRE fundamentals: SLOs/SLIs, error budgets, incident management, capacity planning, and operational excellence
  • Extensive experience with AWS, GCP and Azure managed services
  • Strong backend engineering fundamentals and experience building and operating distributed systems in production.
  • Hands-on experience with Kubernetes, Kubernetes Operators/Controllers, containerized workloads, and Infrastructure as Code (Terraform, Pulumi)
  • Excellent communication: can translate reliability tradeoffs to product/exec stakeholders and write crisp incident/postmortem artifacts
  • Proven ability to translate operational pain points into engineering deliverables

Nice To Haves

  • Experience working with or integrating AI-powered systems or tooling
  • Experience operating multi-tenant or high-isolation customer environments
  • Familiarity with distributed databases and performance tuning at scale
  • Experience building internal developer platforms or paved paths

Responsibilities

  • Manage and grow two teams (up to 10 engineers) across SRE and Cloud Platform
  • Coach senior engineers through technical ambiguity and design tradeoffs
  • Recruit, hire, onboard and develop engineers while elevating the overall strength of the team
  • Own the technical direction for infrastructure, reliability, and cloud platform foundations
  • Partner with product and engineering leaders to shape the roadmap
  • Guide project planning by defining milestones, identifying dependencies, and working toward timely and meaningful delivery
  • Ensure the systems and platforms are operable, debuggable, and resilient by design
  • Participate in on-call rotations at a sustainable level to stay grounded in real operational issues

Benefits

  • Opportunity to work with cutting-edge technology in a rapidly growing sector
  • A supported environment where your ideas lead to real impact
  • Competitive salary based on experience
  • Stock options at an early-stage startup
  • Comprehensive benefits including healthcare (US-based) and other insurance
  • A full remote and flexible schedule to accommodate different timezones
  • Twice-yearly travel for team offsites focused on team bonding, collaboration, and having fun!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service