Staff Software Engineer, Billing

DockerSeattle, WA
Remote

About The Position

Docker has been one of the most loved brands in developer tooling, trusted by more than 20 million monthly users and over 20 billion container image pulls. From solo founders to the world's largest companies, developers rely on Docker to build, share, and run their applications across our suite of products including Docker Desktop, Docker Hub, and Docker Scout. We are a globally distributed, remote-first team building the tools that define how software gets built and delivered. As AI agents redefine software development, Docker is at the center of that shift, providing the sandboxed environments, verified images, and secure infrastructure that make autonomous workflows trustworthy by default. We're building AI-native development practices into how this team works at a foundational level. That means infrastructure design needs to account for a new kind of collaborator: AI agents that generate, deploy, and operate software. The Staff Infrastructure Engineer on this team won't just keep systems running — they'll define what safe, observable, AI-assisted infrastructure operations look like in practice, and set the standard for how the broader engineering organization follows.

Requirements

  • 8+ years in platform, infrastructure, or SRE roles supporting production SaaS systems at scale
  • Deep AWS expertise: ECS or EKS, RDS (Postgres preferred), networking, IAM, cost management — you've operated these systems under real load and real incidents
  • Expert-level Terraform; you've designed reusable module patterns and set standards others follow
  • Experience building and owning observability stacks (Datadog, Grafana, or similar) at an organizational level — not just using them
  • Strong familiarity with CI/CD systems — Jenkins, GitHub Actions, or equivalent — including pipeline design and developer experience ownership
  • Kubernetes at an operational and architectural level
  • A track record of identifying systemic risks and driving improvements that span team or organizational boundaries
  • Security-first mindset: threat modeling, blast radius analysis, least-privilege by default, audit trails as a design requirement
  • Strong written English; at Staff level, written communication is how you scale your influence across teams

Nice To Haves

  • You don't wait for problems to be handed to you — you find them, frame them, and drive the solution.
  • You've operated at a scope where your decisions affected multiple teams or systems, and you know how to build consensus and move work forward without direct authority.
  • You've thought seriously about what infrastructure needs to look like when AI agents are generating and shipping code — safe deployment patterns, strong observability, clean rollback — and you want to help define that standard here.
  • Experience with billing, payments, or financial systems infrastructure is a meaningful plus.

Responsibilities

  • Own and evolve the infrastructure supporting Billing Platform services: compute, storage, networking, CI/CD, and observability
  • Design and maintain IaC (Terraform) for billing system infrastructure on AWS; set module patterns and standards for the team
  • Build and own observability systems — metrics, logging, alerting — with a focus on billing accuracy and payment reliability
  • Define deployment patterns and runbooks that work well in an AI-agent-assisted development workflow: clear rollback procedures, safe promotion gates, automated validation
  • Partner with software engineers on service design — bringing infrastructure constraints and operational requirements into the conversation before code is written
  • Identify systemic risks and drive improvements that span team or organizational boundaries
  • Lead incident response for billing system issues; own the on-call rotation and postmortem process
  • Mentor engineers across the team; your technical judgment should raise the floor for everyone

Benefits

  • Freedom & flexibility; fit your work around your life
  • Designated quarterly Whaleness Days plus end of year Whaleness break
  • Home office setup; we want you comfortable while you work
  • 16 weeks of paid Parental leave
  • Technology stipend equivalent to $100 net/month
  • PTO plan that encourages you to take time to do the things you enjoy
  • Training stipend for conferences, courses and classes
  • Equity; we are a growing start-up and want all employees to have a share in the success of the company
  • Docker Swag
  • Medical benefits, retirement and holidays vary by country
  • Remote-first culture, with offices in Seattle and Paris

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Education Level

No Education Listed

Number of Employees

251-500 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service