Staff Software Engineer, Infrastructure

Docker
CA$238,250 - CA$382,250Remote

About The Position

Docker is seeking a Staff Software Engineer, Infrastructure to join their globally distributed, remote-first team. This role is crucial for building and enhancing the platform that supports hundreds of engineers and high-scale production traffic. The primary focus will be on transforming expert-driven support into self-service systems with clear ownership, safe defaults, and strong guardrails. The goal is to create a platform that teams can trust and rely on, allowing them to focus on their own products. This year's roadmap includes significant improvements to infrastructure, such as reducing the time it takes to spin up new global regions or application environments from days to hours. This involves building foundations for multi-region, cross-account network architecture, and a robust testing and continuous-deployment flow. The successful candidate will join a growing team of four, expanding to seven, and will be responsible for setting technical direction and driving production adoption.

Requirements

  • 8+ years of professional, hands-on, full-time software engineering experience in backend, infrastructure, or platform engineering.
  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience
  • Strong software engineering in Go or a similar language: design, testing, debugging, review, long-term maintainability.
  • A track record designing, shipping, and operating cloud services or infrastructure platforms in production.
  • Deep expertise in at least one of: Kubernetes, networking, cloud platforms, reliability engineering, or developer platforms, plus solid Linux, networking, and production-ops fundamentals.
  • Experience setting technical direction and leading work that needs cross-team alignment.
  • Clear written and verbal communication in a remote environment (RFCs, design docs, incident writeups).

Nice To Haves

  • EKS and ingress/CNI/service-mesh experience
  • Observability with OpenTelemetry/Prometheus/Grafana
  • CI/CD and progressive delivery (GitHub Actions, Argo CD, canaries)
  • Experience leading migrations or adoption programs across teams.

Responsibilities

  • Take ambiguous infrastructure problems and turn them into proposals the org can rally around, then drive them through RFCs and architecture reviews across teams.
  • Design self-service capabilities and platform APIs (primarily in Go) for onboarding, provisioning, deployment, observability defaults, and day-2 operations, with contracts and docs teams actually use.
  • Set delivery standards using Terraform, GitOps with Argo CD, progressive rollout, and good testing, including building the continuous-deployment flow we're missing today.
  • Evolve the multi-tenant EKS foundations toward better reliability, security, scale, and cost: Envoy Gateway ingress, traffic routing, and the multi-region, cross-account connectivity we need.
  • Improve SLOs, alerting, and incident follow-up on Grafana Cloud so production gets safer and less dependent on heroics.
  • Actively invest in AI-assisted and agentic workflows to cut operational toil, ensuring they stay safe, auditable, and human-reviewed.
  • Shape where AI-assisted operations earn their place and where they don't, with early targets including alert enrichment, incident context-gathering, runbook-assisted diagnosis and remediation recommendations, and onboarding and readiness assistants.
  • Join the on-call rotation after onboarding and shadowing.
  • Improve the health of on-call itself, with better alerts, stronger runbooks, less toil, and blameless postmortems aimed at prevention.

Benefits

  • Freedom & flexibility; fit your work around your life
  • Designated quarterly Whaleness Days plus end of year Whaleness break
  • Home office setup; we want you comfortable while you work
  • 16 weeks of paid Parental leave (after 6 months of employment)
  • Technology stipend equivalent to $100 USD net/month
  • PTO plan that encourages you to take time to do the things you enjoy
  • Training stipend for conferences, courses and classes
  • Equity; we are a growing start-up and want all employees to have a share in the success of the company
  • Docker Swag
  • Medical benefits, retirement and holidays vary by country
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service