About The Position

NVIDIA's DGX Cloud is redefining how organizations deploy and scale AI infrastructure. We're looking for a Senior Technical Program Manager to drive storage-related initiatives across development, operations, and cloud deployment. This is a high-impact role interfacing with engineering, product, operations, finance, and our global cloud partners. What you'll be doing: As a DGX Cloud Storage Technical Program Manager, you will be the connective tissue Lead cross-functional storage programs from requirements gathering through execution and delivery. Drive alignment across NVIDIA storage engineering, operations, cloud service providers, clusters operators, resource governance and finance. Define project plans, schedules, and achievements for storage features, storage deployments, support, security, compliance, and observability. Partner with the engineering team and product management to define and deliver products roadmap. Manage technical risks and resolve blockers that impact quality, scope, and delivery timelines. Coordinate with cross-functional teams to improve workflows, efficiency, and transparency. Ensure program visibility across the organization and maintain strong communication channels with senior stakeholders. Improve organizational efficiency by collaborating with multi-functional leads and optimizing processes Cultivate a culture of continuous improvement, finding opportunities for process enhancements

Requirements

  • 12+ years of experience in program management of large-scale software or infrastructure projects
  • MS EE or CS degree, or equivalent experience
  • Proven success driving programs across global, distributed teams.
  • Outstanding communication and organizational skills, with the ability to align cross-org stakeholders.
  • Expertise with tools like Jira and Confluence, and the ability to guide teams in their use.
  • Strong foundation in software development, Agile methodologies, and DevOps best practices.
  • Familiarity with Cloud Platforms: AWS, Azure, GCP, or OCI storage services (Block, Object, File)
  • Knowledge of Distributed Storage Systems: SAN, NAS, object storage, and scalable distributed architectures such as Ceph or Lustre.
  • Storage Performance: Understanding IOPS, latency, throughput optimization, and capacity planning for large-scale environments
  • Data Protection & DR: Familiarity with snapshots, backups, replication, and disaster recovery strategies
  • AI/ML & HPC Workloads: Understanding storage requirements for high-throughput AI training or data pipelines

Nice To Haves

  • Hands-on experience with storage operations, provisioning, performance monitoring, and troubleshooting.
  • Experience with new product introduction and program managing research teams.

Responsibilities

  • Lead cross-functional storage programs from requirements gathering through execution and delivery.
  • Drive alignment across NVIDIA storage engineering, operations, cloud service providers, clusters operators, resource governance and finance.
  • Define project plans, schedules, and achievements for storage features, storage deployments, support, security, compliance, and observability.
  • Partner with the engineering team and product management to define and deliver products roadmap.
  • Manage technical risks and resolve blockers that impact quality, scope, and delivery timelines.
  • Coordinate with cross-functional teams to improve workflows, efficiency, and transparency.
  • Ensure program visibility across the organization and maintain strong communication channels with senior stakeholders.
  • Improve organizational efficiency by collaborating with multi-functional leads and optimizing processes
  • Cultivate a culture of continuous improvement, finding opportunities for process enhancements

Benefits

  • competitive salaries
  • generous benefits package
  • equity
  • benefits
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service