About The Position

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. NVIDIA builds and operates its own Kubernetes distribution and managed engine service (NKE) that powers GPU clusters across cloud environments and on-premises deployments. This is the infrastructure underneath everything — the control planes, the components, the upgrade paths, and the distribution that NVIDIA and our customers depend on to run AI workloads optimally. We need a product manager who can lead this layer end-to-end. In this role, you will define how we deliver Kubernetes as a service: control plane lifecycle, core component strategy, upstream alignment, distribution packaging, and the developer and operator tooling that makes clusters manageable at scale. This role requires deep Kubernetes infrastructure experience. Have you built or operated a Kubernetes service? Run a distribution? Led platform engineering for Kubernetes at scale? If so, this role is for you!

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a similar area, or equivalent experience.
  • 8+ years of product management experience in Kubernetes infrastructure, Kubernetes services, or platform engineering
  • Deep understanding of Kubernetes internals: control plane architecture, etcd, scheduling, networking, storage integration, and upgrade mechanics
  • Experience shipping a Kubernetes distribution, K8s service, or enterprise platform product
  • Track record of leading upstream open source alignment alongside production delivery constraints
  • Experience with on-premises and hybrid deployment models, not just public cloud

Nice To Haves

  • Building or operating EKS, AKS, GKE, OpenShift, Rancher, or similar K8s platforms
  • Hands-on experience with NVIDIA GPU infrastructure, DGX systems, or GPU-aware K8s scheduling
  • Shipping Kubernetes tooling used by operators in production (cluster management, diagnostics, lifecycle automation)
  • K8s conformance certification, CIS benchmarks, or security hardening for enterprise or government environments
  • Contributions to upstream Kubernetes or CNCF ecosystem projects

Responsibilities

  • Own the NKE product surface: control plane lifecycle management, API server availability, component upgrades, and cluster provisioning and teardown
  • Define our Kubernetes distribution strategy — packaging, conformance, version policy, and release cadence for NVIDIA-managed and on-premises environments
  • Drive upstream Kubernetes alignment: feature adoption, contribution strategy, and release tracking that keeps us current without introducing instability
  • Own developer and operator tooling for cluster management, diagnostics, and day-2 operations across environments
  • Define and publish tooling that enables on-premises customers and partners to deploy, run, and upgrade NVIDIA Kubernetes clusters independently
  • Drive service reliability, upgrade safety, and multi-tenant isolation at the provider layer

Benefits

  • NVIDIA offers highly competitive salaries and a comprehensive benefits package.
  • You will also be eligible for equity and benefits

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service