Senior Technical Program Manager, Cloud Infrastructure NPI

NVIDIASanta Clara, CA
$168,000 - $322,000

About The Position

NVIDIA’s deep learning platforms have made a major impact in various fields and are broadly used across leading academic institutions, start-ups, and industry, including the world’s largest Internet companies. We are seeking an experienced and talented technical program manager for NVIDIA's DGX Cloud. We need passionate, hard-working, and creative people to help us deliver value to DGX Cloud customers. As a DGX Cloud Technical Program Manager, you'll be a key partner to our Engineering, Infrastructure, and Software teams, driving critical cloud infrastructure programs across DGX Cloud. You'll play a pivotal role in maturing how we bring up AI capacity — strengthening process resiliency, driving automation into our PLC and acceptance workflows, and enabling early access to upcoming NVIDIA platforms. This is a dynamic, fast-paced environment where TPMs are motivated to apply fungible abilities to a range of high-impact programs.

Requirements

  • 12+ years of technical program management experience, with a focus on infrastructure, hardware/software integration, or cloud platforms
  • Success in leading NPI or large cross-functional programs in fast-paced environments
  • Experience working with cloud service providers, large-scale data center deployments, or enterprise-scale infrastructure programs
  • Strong understanding of GPU compute, Kubernetes, CI/CD pipelines, and cloud-native services
  • Demonstrated experience building or improving product development processes and team workflows
  • Skilled in tools such as JIRA, Confluence, dashboards, and reporting tools
  • Ability to influence cross-functional teams, including HW, SW, QA, Site Ops, and Product
  • Outstanding communication and leadership skills, capable of collaborating effectively with senior collaborators
  • BS/MS in CS, EE, related technical field, or equivalent experience

Nice To Haves

  • Experience in launching cloud infrastructure products or large-scale hardware-software systems
  • Previous involvement in New Product Introduction (NPI), including platform bring-up and validation
  • Familiarity with AI infrastructure, or GPU-based cloud platforms
  • Experience with process automation, observability (telemetry/metrics), and health check frameworks
  • Passion for building repeatable systems, tools, and cross-org efficiency at scale

Responsibilities

  • Lead the end-to-end execution of NPI programs across engineering, operations, and cloud service provider (CSP) partners
  • Lead the DGX Cloud NPI Early Access Program — Enabling processes that leading to engineering teams across DGXC getting early access to critical Nvidia systems (i.e. VR / VR Ultra) in order to develop critical software and automation.
  • Drive PLC process for capacity bring-up into system-based, automated solutions — taking a baseline PLC type process and driving iterative improvements and system approaches to codifying in tooling including jira.
  • Coordinate site readiness and infrastructure bring-up activities, including networking, inventory, corp IT, and security integration
  • Partner with SW stack teams to track development, testing, and integration across product phases
  • Define and implement acceptance testing, validation workflows, and readiness gates for new platforms
  • Work closely with stakeholders to develop scalable NPI processes, tools, and dashboards
  • Drive automation efforts for break/fix workflows, telemetry enablement, and system health validation
  • Facilitate regular communication with leadership, engineering, CSP teams, and Colo partners and cultivate a culture of continuous improvement and process innovation

Benefits

  • highly competitive salaries
  • comprehensive benefits package
  • equity
  • benefits
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service