About The Position

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. This role sits at the intersection of CI/CD, cloud operations, and GPU-centric microservices, supporting systems that must be both highly reliable and continuously evolving. The Senior Manager, CI/CD and Cloud Operations plays a critical leadership role in crafting how Omniverse services are built, tested, deployed, and operated globally.

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • Typically 12+ overall years of overall experience in infrastructure engineering, cloud operations, CI/CD, or related domains, or equivalent experience.
  • 5+ years of people management and technical leadership experience, including leading managers or senior engineers.
  • Deep understanding of modern CI/CD architectures, automated testing strategies, and delivery pipelines.
  • Strong experience with containerized environments and orchestration platforms such as Docker and Kubernetes.
  • Validated background operating cloud infrastructure at scale, including networking, storage, security, and control-plane services.
  • Ability to drive cross-team initiatives from build through execution and sustained operations.
  • Clear communication and leadership skills, with a track record of mentoring engineers and building high-performing teams.

Nice To Haves

  • Experience supporting GPU-intensive or high-performance computing workloads in production environments.
  • Familiarity with large-scale microservice platforms serving global user bases.
  • Background working closely with SRE organizations on reliability, observability, and incident management practices.
  • Experience introducing or modernizing CI/CD platforms in fast-growing or highly complex engineering organizations.

Responsibilities

  • Lead multiple teams responsible for CI/CD platforms and cloud operations that support GPU-accelerated, microservice-based Omniverse workloads.
  • Drive the build, evolution, and operation of end-to-end CI/CD systems, including build pipelines, automated testing, deployment workflows, and release management.
  • Coordinate cloud infrastructure used by Omniverse services, spanning compute, networking, storage, security, and control-plane components at global scale.
  • Partner closely with software engineering, SRE, and product teams to align development workflows, operational requirements, and delivery timelines.
  • Guide the development of tooling and services that improve rollout safety, observability, monitoring, and lifecycle management of distributed systems.
  • Evaluate new technologies in CI/CD, cloud platforms, and GPU compute, introducing improvements that raise reliability, efficiency, and developer productivity.
  • Establish strong operational practices, including, incident readiness, postmortems, and continuous improvement across owned systems.

Benefits

  • NVIDIA offers highly competitive salaries and a comprehensive benefits package.
  • equity
  • benefits
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service