About The Position

NVIDIA’s DGX Cloud is redefining how organizations deploy and scale AI infrastructure. We’re looking for a Senior Technical Program Manager to drive storage-related initiatives across development, operations, and cloud deployment. This is a high-impact role interfacing with engineering, product, operations, finance, and our global cloud partners. What you'll be doing: As a DGX Cloud Storage Technical Program Manager, you will be the connective tissue

Requirements

  • 12+ years of experience in program management of large-scale software or infrastructure projects
  • MS EE or CS degree, or equivalent experience
  • Proven success driving programs across global, distributed teams.
  • Outstanding communication and organizational skills, with the ability to align cross-org stakeholders.
  • Expertise with tools like Jira and Confluence, and the ability to guide teams in their use.
  • Strong foundation in software development, Agile methodologies, and DevOps best practices.
  • Familiarity with Cloud Platforms: AWS, Azure, GCP, or OCI storage services (Block, Object, File)
  • Knowledge of Distributed Storage Systems: SAN, NAS, object storage, and scalable distributed architectures such as Ceph or Lustre.
  • Storage Performance: Understanding IOPS, latency, throughput optimization, and capacity planning for large-scale environments
  • Data Protection & DR: Familiarity with snapshots, backups, replication, and disaster recovery strategies
  • AI/ML & HPC Workloads: Understanding storage requirements for high-throughput AI training or data pipelines

Nice To Haves

  • Hands-on experience with storage operations, provisioning, performance monitoring, and troubleshooting.
  • Experience with new product introduction and program managing research teams.

Responsibilities

  • Lead cross-functional storage programs from requirements gathering through execution and delivery.
  • Drive alignment across NVIDIA storage engineering, operations, cloud service providers, clusters operators, resource governance and finance.
  • Define project plans, schedules, and achievements for storage features, storage deployments, support, security, compliance, and observability.
  • Partner with the engineering team and product management to define and deliver products roadmap.
  • Manage technical risks and resolve blockers that impact quality, scope, and delivery timelines.
  • Coordinate with cross-functional teams to improve workflows, efficiency, and transparency.
  • Ensure program visibility across the organization and maintain strong communication channels with senior stakeholders.
  • Improve organizational efficiency by collaborating with multi-functional leads and optimizing processes
  • Cultivate a culture of continuous improvement, finding opportunities for process enhancements

Benefits

  • competitive salaries
  • generous benefits package
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service