Token-as-a-Service Technical Program Manager

OpenAISan Francisco, CA
Hybrid

About The Position

OpenAI’s Stargate and 3P Engineering teams are responsible for building and scaling the external infrastructure ecosystem that powers advanced AI systems. We work across hyperscalers, colocation providers, cloud partners, and strategic third-party operators to turn contracted capacity into production-ready compute. Our scope spans the full lifecycle of external deployments: commercial alignment, technical readiness, network integration, hardware enablement, operational readiness, and long-range scaling strategy. As OpenAI’s infrastructure footprint expands globally, we need leaders who can convert complex partner environments into reliable, high-velocity capacity for training and inference workloads. We are seeking a Technical Program Manager, Token-as-a-Service (TaaS) to lead delivery of external compute capacity that directly serves OpenAI model workloads. In this role, you will own complex cross-functional programs that transform third-party infrastructure into usable tokens at scale. You will partner across engineering, capacity planning, networking, hardware, finance, product, and external providers to ensure that deployed capacity translates into real production throughput. This role sits at the intersection of infrastructure execution, systems readiness, and business impact. Success requires strong technical fluency, elite program management, and the ability to drive accountability across internal teams and external partners. This is a high-visibility role with direct impact on OpenAI’s ability to scale model training and inference globally.

Requirements

  • 8+ years of Technical Program Management, Engineering Program Management, or Infrastructure Delivery experience.
  • Experience leading large-scale technical programs involving cloud, data center, networking, hardware, or distributed systems.
  • Strong understanding of compute infrastructure, clusters, networking, storage, and production systems.
  • Proven ability to drive cross-functional execution across engineering, operations, finance, and external vendors.
  • Experience managing executive stakeholders and communicating complex tradeoffs clearly.
  • Strong analytical skills with ability to reason about utilization, throughput, capacity, and operational metrics.
  • Comfortable operating in ambiguous, fast-scaling environments.
  • Strong written and verbal communication skills.
  • High ownership mentality with bias toward action.

Nice To Haves

  • Experience working with external providers, strategic partners, or hyperscalers is highly preferred.
  • Experience with GPU clusters, AI infrastructure, or large-scale model serving environments.
  • Familiarity with token economics, inference capacity planning, or workload scheduling.
  • Experience scaling global infrastructure through third-party providers.
  • Background in systems engineering, networking, or hardware deployment programs.
  • Experience building new operational models in high-growth environments.

Responsibilities

  • Lead end-to-end delivery programs that convert external infrastructure capacity into production-ready token supply.
  • Own readiness across compute, storage, networking, security, and operational dependencies for third-party environments.
  • Build integrated plans across internal engineering teams and external partners with clear milestones, owners, risks, and critical paths.
  • Drive launch execution for new partner regions, clusters, and capacity expansions.
  • Create operating mechanisms that measure deployed capacity versus usable token output.
  • Identify bottlenecks preventing token generation (network constraints, hardware readiness, software enablement, partner delays, etc.) and drive resolution.
  • Coordinate with capacity planning and finance teams to prioritize the highest ROI capacity opportunities.
  • Establish executive-level reporting on delivery status, risks, and token ramp forecasts.
  • Improve repeatability of partner onboarding, technical integration, and scaling motions.
  • Manage escalations across internal and external stakeholders during high-severity delivery issues.
  • Translate ambiguous infrastructure constraints into clear execution plans.
  • Help define the long-term operating model for Token-as-a-Service across Stargate and 3P ecosystems.

Benefits

  • Relocation assistance is available.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Education Level

No Education Listed

Number of Employees

1-10 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service