Lead SRE/DevOps Engineer

Launch PotatoPhiladelphia, NY
Remote

About The Position

Launch Potato is a profitable digital media company reaching over 30M+ monthly visitors. As The Discovery and Conversion Company, our mission is to connect consumers with the world’s leading brands through data-driven content and technology. Headquartered in South Florida with a remote-first team spanning over 15 countries, we’ve built a high-growth, high-performance culture where speed, ownership, and measurable impact drive success. This role involves owning and evolving Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture. The goal is to build the SRE function from the ground up, enabling product teams to ship faster without compromising reliability, security, or cost control.

Requirements

  • 5+ years of production AWS infrastructure experience with deep Terraform expertise.
  • Hands-on experience building the SRE function from scratch and had complete ownership.
  • Experience with a multi-site company where PaaS or microservices are required.
  • CI/CD pipeline ownership in one or more previous roles.
  • PagerDuty experience and standing up an on-call rotation.
  • 5+ years hands-on with AWS, Terraform, CI/CD pipeline ownership, and SRE tooling (OpenTelemetry, Grafana, PagerDuty or equivalent) in a production environment.
  • Ownership orientation: Proactively identify and fix issues, create documentation.
  • Documentation discipline: Write runbooks, decision rationale, architecture patterns, incident post-mortems.
  • Cost consciousness: Understand and communicate the business impact of infrastructure decisions.
  • Calm under pressure: Ability to handle production incidents, triage clearly, communicate proactively, and run post-mortems.
  • Cross-functional communication: Ability to work with product engineers, legal/compliance, and executive leadership.

Responsibilities

  • Own and evolve Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture.
  • Build the SRE function from the ground up.
  • Stand up the SRE practice from scratch, including on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards.
  • Complete the AWS multi-account migration.
  • Deliver SOC 2 Type I audit-ready infrastructure evidence package.
  • Version and publish the Terraform module library to a private registry.
  • Implement automated deployment rollback for ECS and Lambda.
  • Stand up monthly cost reporting to leadership.

Benefits

  • Profit-sharing bonus
  • Competitive benefits
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service