Systems Development Manager, Region Reliability

AmazonBellevue, WA
Onsite

About The Position

The Amazon Dedicated Cloud (ADC) Region Reliability team is searching for a talented, detail-oriented Systems Development Manager (SysDM) to support AWS activities in US ADC regions supporting the Department of Defense and Intelligence Community. A SysDM will work to solve service-centric technical and business problems, devise strategy, and facilitate technology solutions that may not yet be defined. They will deliver independently, with limited guidance, across cross-functional initiatives. Additionally, individuals must act as force multipliers, understand escalation, and find a path forward in difficult situations. The role also necessitates understanding security risks caused by technical complexity, and being able to make trade-offs between short-term and long-term needs. From Day 1, SysDMs are given ownership of a distributed team of engineers responsible for the operational health services in secure, air-gapped cloud environments. You'll own the team's roadmap, from hiring and developing talent, to defining automation strategy, to ensuring your services meet or exceed their availability and performance targets. You won't wait for direction; you'll assess the landscape, identify the highest-impact opportunities, and start driving results. On a day-to-day basis, you will lead your team through a mix of operational and strategic work. That means triaging and resolving service issues, driving root cause analysis to prevent repeat incidents, and building the dashboards and metrics that keep your team ahead of problems rather than reacting to them. You'll partner with service teams and program managers to support new region builds and feature launches, ensuring operational readiness before go-live. You'll also spend meaningful time on your people, coaching engineers, removing blockers, shaping career paths, and building a team culture grounded in ownership and continuous improvement.

Requirements

  • Bachelor's degree, or CSSLP (Certified Secure Software Lifecycle Professional)
  • 5+ years of relevant systems engineering, software development, or infrastructure engineering experience
  • 3+ years of experience managing engineering teams, including hiring, developing, and retaining talent
  • Experience operating in a 24x7 production environment supporting mission-critical services
  • Experience with Linux/Unix systems administration, networking, and infrastructure automation
  • Strong written and verbal communication skills
  • Current, active US Government Security Clearance of Top Secret with SCI eligibility or above

Nice To Haves

  • Experience managing engineering teams supporting cloud computing services at scale
  • Experience with support procedures and methodologies for production environments, including ticketing, monitoring, metrics, and SLA management
  • Experience improving systems through architecture, design patterns, reliability, and operational scaling
  • Strong analytical skills with the ability to use data and metrics to drive decisions and measure impact
  • Experience working across organizational boundaries, bringing together people with diverse perspectives to deliver results
  • Experience with Agile engineering practices (Kanban, Scrum, continuous delivery)
  • Familiarity with at least one modern programming or scripting language (Python, Java, Go, or similar)

Responsibilities

  • Lead, hire, develop, and retain a team of engineers supporting services in secure, air-gapped AWS ADC regions
  • Own your team's operational roadmap, working backwards from customer problems to define priorities, scope work, and deliver results
  • Drive operational excellence through automation, metrics, and process improvements that reduce manual toil and enable the team to scale
  • Triage and resolve service issues, lead root cause analysis, and implement preventative fixes so problems do not repeat
  • Partner with service teams, program managers, and cross-functional stakeholders to support new region builds, feature launches, and service parity across ADC regions
  • Identify and manage technical and operational risks, clearly communicating trade-offs and mitigation strategies to stakeholders
  • Track and report your team's progress using metrics that capture customer impact, SLA compliance, and system health

Benefits

  • sign-on payments
  • restricted stock units (RSUs)
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service