Senior DevOps Engineer

NVIDIASanta Clara, CA
1d

About The Position

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. Omniverse is NVIDIA’s scalable real‑time 3D application development platform. It integrates technologies like OpenUSD, RTX‑accelerated rendering, physics, robotics, and digital twins to enable simulation and collaboration at scale. NVIDIA’s Developer Infrastructure team builds and maintains the critical R&D environments as well as the CI/CD pipelines and orchestration systems upon which Omniverse developer workflows depend.

Requirements

  • Bachelor’s in Computer Science, Software Engineering, or equivalent experience.
  • 8+ years of professional experience in DevOps, SRE, or Build/Release engineering roles at similar scale.
  • Proficient in Python for scripting, tooling, and automation.
  • Strong experience with at least one of the major cloud providers: Amazon AWS, Microsoft Azure, or Google Cloud Platform—including orchestration and infrastructure automation.
  • Deep hands‑on experience in CI/CD, virtualization, and container orchestration.
  • Usage of tools like GitLab CI/CD, Jenkins, CircleCI, Docker, Kubernetes is required.
  • Working familiarity with GitHub or GitLab version‑control and CI/CD workflows.
  • Solid understanding of Linux (and/or Windows) environments and networking.

Nice To Haves

  • Familiarity with GPU‑accelerated systems, CUDA, or NVIDIA DGX / GPU cluster infrastructure!
  • Exposure to AI inference workloads, robotics simulation (Isaac), or high‑fidelity 3D graphics pipelines!
  • Background in infrastructure monitoring, build/test pipeline optimization, or platform engineering.

Responsibilities

  • Build and operate highly reliable, scalable CI/CD pipelines based on GitLab and Github, as well as maintain developer R&D environments using virtualized GPU desktop instances for Omniverse and Isaac development environments, across production, staging, and dev systems
  • Support orchestration of GPU‑enabled clusters running Linux and Windows workloads using containerization tools (Docker, Kubernetes) and virtualization (VMware, KVM).
  • Automate infrastructure provisioning, scaling, and configuration using Infrastructure‑as‑Code (Terraform, Ansible, AWS CDK, Azure Bicep, etc.).
  • Monitor and optimize system health, performance, build/test throughput, and pipeline reliability.
  • Collaborate closely with Omniverse and Isaac developer teams to smooth release workflows, increase velocity, and code quality.
  • Mentor and guide junior team members; lead by example in DevOps protocols.

Benefits

  • NVIDIA offers highly competitive salaries and a comprehensive benefits package.
  • As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/
  • You will also be eligible for equity and benefits.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service