Senior Technical Program Manager, DGX Cloud Software Products and Service

NVIDIASanta Clara, CA
$200,000 - $322,000Hybrid

About The Position

DGX Cloud Team is looking for a Senior Technical Program Manager (TPM) to guide complex, cross-functional projects that support NVIDIA’s next-generation AI infrastructure. This position involves leading software-related initiatives across cloud platforms, infrastructure services, and distributed systems. The role focuses heavily on cloud-native software delivery, Kubernetes-based platforms, and large-scale AI workloads. You will be responsible for managing high-impact engineering programs within a dynamic, fast-paced roadmap, aligning priorities across teams, and ensuring timely and high-quality delivery. This role requires strong technical skill, a proactive approach, and the ability to operate effectively across multiple levels of the organization. We are specifically looking for a software TPM with strong Kubernetes experience who can help drive execution across platform software and cloud infrastructure.

Requirements

  • Postgraduate degree in Computer Science, Artificial Intelligence, or equivalent experience.
  • 12+ years of program management experience, including proven ability managing global projects across multiple time zones.
  • Solid knowledge of cloud-native software systems, Kubernetes, containerized applications, microservices architectures, and infrastructure-as-a-service (IaaS) platforms.
  • Practical experience working with Kubernetes is required.
  • Proven experience driving large-scale software programs in fast-paced engineering environments.
  • Strong understanding of software engineering guidelines, release procedures, system integration, and platform delivery.
  • Proven experience creatively resolving technical issues and resource conflicts.
  • Detail oriented with proven ability to multitask in a dynamic environment with shifting priorities and changing requirements.
  • Direct experience working within a dynamic software development environment is essential.
  • Excellent communication and technical presentation skills.
  • Significant experience with large-scale Agile tools, reporting, and processes relevant to this role is required.
  • Demonstrated skill in engaging and moderating successful engagements with engineering, operations, and product teams.

Nice To Haves

  • Strong background in Machine Learning, Deep Learning, and Artificial Intelligence applications.
  • Prior experience leading programs for Kubernetes platforms, cloud-native infrastructure, platform services, or developer platforms.
  • Experience with software release management, service operationalization, and large-scale platform adoption.
  • Familiarity with observability, CI/CD, infrastructure automation, and service reliability practices in cloud environments.
  • Consistent track record of driving process improvements and measuring efficiency.
  • Familiarity with NVIDIA platforms, products, and ecosystem is a plus.

Responsibilities

  • Lead the complete implementation of DGX Cloud software initiatives, encompassing planning, management, delivery, and operationalization across NVIDIA’s cloud infrastructure.
  • Partner with software, infrastructure, product, and platform engineering teams to align on goals, architecture achievements, deliverables, and schedules.
  • Lead initiatives involving Kubernetes-based platforms, cloud-native services, platform APIs, and distributed systems that enable AI training and inference workloads.
  • Define and implement scalable program management processes, tools, and guidelines to ensure high execution velocity and program transparency.
  • Identify cross-functional dependencies, mitigate risks, and drive resolution of complex technical and programmatic issues across the software stack.
  • Establish clear success metrics and reporting mechanisms to track progress and communicate status to senior leadership.
  • Foster a culture of collaboration and continuous improvement across engineering, product, and operations teams.
  • Develop and implement metrics for assessing program efficiency and identifying areas for improvement, collect and analyze data to support planning and data-driven decisions.
  • Report on overall program status, providing insights and recommendations to senior management.
  • Drive organizational alignment and efficiency by coordinating with multi-functional leads and streamlining processes across software development lifecycles and release execution.

Benefits

  • Competitive salaries
  • Generous benefits package
  • Equity
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service