About The Position

The infrastructure powering AWS's next-generation AI services does not build itself. Behind every large-scale machine learning workload is a physical network: racks of compute, miles of fiber, and a precise sequence of construction, deployment, and validation events that have to land on time and in the right order. The AMER AWS Brick and Network Deployment (BaND) ML team owns Machine Learning DC Network Delivery across the US for customized DC’s built for ML. We are looking for a Technical Program Manager who wants to be at the center of it. You will own network delivery end-to-end across a portfolio of US-wide data center builds, driving the full network lifecycle from early construction coordination through network general services, optical turnup, and production fabric readiness. You will work directly with network engineers, data center construction teams, optical design, and supply chain, cutting through ambiguity, resolving blockers, and keeping complex multi-stream programs moving in parallel. Candidates who arrive with fluency in hyperscale network architecture, structured cabling systems, and data center construction sequencing will be effective from the first week. The work is technical, fast-paced, and consequential. What you will own: • End-to-end network delivery across all service layers for US-wide builds: production fabric (10p10u ML brick rack deployments including top-of-rack and spine cabling), optical (border, edge, backbone, and internet circuit turnup), and general services (Corp, security, WiFi, management console) • Rack deployment sequencing and cabling execution: you understand structured cabling plant, tray fill, fiber termination workflows, cut sheet coordination, and what it takes to move from empty cage to powered and patched on a hyperscale floor • Optical and circuit readiness: familiarity with DWDM, dark fiber provisioning, circuit activation workflows, and the handoff sequence between optical design, field teams, and network engineering • Construction phase coordination: you know how MEP sequencing, raised floor or overhead cable routing, and power-on sequencing affect network deployment timelines • Cross-org coordination across Network Engineering, Cabling, Construction, and supply chain to maintain critical path alignment and surface dependencies before they become blockers • Milestone tracking and readiness reporting tied to key program gates, with clear escalation to senior leadership when commitments are at risk • Escalation ownership: you identify the blocker, frame the ask, and drive resolution with the right stakeholders at the right level What you bring: • 3+ years of technical program management experience in hyperscale infrastructure, network deployment, or data center construction environments • Hands-on familiarity with large-scale structured cabling systems: you can read a cut sheet, understand tray and conduit routing, and have a working mental model of how fiber moves through a data center building • Working knowledge of data center network architecture at scale: CLOS topologies, spine-leaf fabric design, ToR switching, and how production ML fabrics differ from standard compute deployments • Experience coordinating across construction, facilities, and network engineering in a live build environment, including managing the interface between construction milestones and network deployment readiness • Demonstrated ability to manage multiple concurrent workstreams across organizational boundaries without losing precision on any of them • Comfort going deep on technical dependencies while communicating clearly at the leadership level: you can translate a fiber tray congestion issue into a program risk statement for Director level audience without losing accuracy • A bias for action and a track record of moving programs forward in ambiguous, fast-moving environments • Strong written and verbal communication: you write clearly, escalate with precision, and do not bury the lead Why this role stands out: The BaND ML team operates at the intersection of physical construction and hyperscale network delivery at a scale very few organizations reach. You will manage concurrent builds across multiple US regions, own real program outcomes that directly affect AWS's AI capacity roadmap, and develop deep cross-functional relationships across some of the most technically demanding infrastructure teams in the industry. If you have been looking for a role where your data center and network domain knowledge is a genuine competitive advantage from day one, this is it.

Requirements

  • 3+ years of technical program management experience in hyperscale infrastructure, network deployment, or data center construction environments
  • Hands-on familiarity with large-scale structured cabling systems: you can read a cut sheet, understand tray and conduit routing, and have a working mental model of how fiber moves through a data center building
  • Working knowledge of data center network architecture at scale: CLOS topologies, spine-leaf fabric design, ToR switching, and how production ML fabrics differ from standard compute deployments
  • Experience coordinating across construction, facilities, and network engineering in a live build environment, including managing the interface between construction milestones and network deployment readiness
  • Demonstrated ability to manage multiple concurrent workstreams across organizational boundaries without losing precision on any of them
  • Comfort going deep on technical dependencies while communicating clearly at the leadership level: you can translate a fiber tray congestion issue into a program risk statement for Director level audience without losing accuracy
  • A bias for action and a track record of moving programs forward in ambiguous, fast-moving environments
  • Strong written and verbal communication: you write clearly, escalate with precision, and do not bury the lead
  • 3+ years of technical product or program management experience
  • 2+ years of software development experience
  • 3+ years of project management disciplines including scope, schedule, budget, quality, along with risk and critical path management experience
  • Experience managing programs across cross functional teams, building processes and coordinating release schedules

Nice To Haves

  • 3+ years of working directly with engineering teams experience

Responsibilities

  • Monitor the network health, customer demand, anticipate risks, resolve issues and initiate corrective action as appropriate.
  • Work with stakeholders (networking engineers/tech program managers/senior management) to define requirements and processes.
  • Manage the dependencies and the interfaces between projects and negotiate the trade-offs needed.
  • Provide program progress reporting on a regular basis.
  • End to end responsibility for the program’s execution and success.
  • Manage multiple competing projects/programs simultaneously.
  • Develop and implement scalable, data-driven process improvement/automated solutions.
  • Keep the customer experience as the primary focus.
  • Meets/exceeds Amazon’s leadership principles requirements for this role

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service