Senior DevOps Engineer - Infrastructure

NVIDIASanta Clara, CA
$184,000 - $287,500

About The Position

We are seeking a highly skilled and experienced Senior DevOps Engineer to join our dynamic NVIDIA Robotics DevOps team. The ideal candidate will have a strong background in managing CI/CD infrastructure. You will work on many open-source and non-open-source applications and packages in the robotics field within a small yet highly efficient team. This role will involve ownership of the infrastructure setup and automation, collaborating with other teams, and ensuring the reliability, scalability, and efficiency of our hardware.

Requirements

  • Bachelor’s or Master’s in CS, CE, EE, or related field (or equivalent experience).
  • 8+ years in DevOps/SRE/infrastructure roles, including ownership of CI or lab environments; 3+ years in a senior capacity.
  • Practical experience with AWS or similar cloud platforms for CI or compute workloads.
  • Hands on experience with CI/CD systems (e.g., GitLab CI, GitHub Actions) and Git based workflows.
  • Strong Linux systems expertise (networking, storage, performance, security).
  • Proficiency in Python and shell for automation and tooling.
  • Hands on work with servers, embedded boards, networking gear, and remote management.

Nice To Haves

  • Experience with NVIDIA Tegra infrastructure.
  • Solid knowledge of containers and orchestration (Docker, Kubernetes).
  • Proven track record of driving infrastructure reliability improvements and cross team projects.

Responsibilities

  • Manage CI runners/executors and capacity across on prem and cloud environments.
  • Use infrastructure as code to provision and update CI environments and supporting services.
  • Deploy and extend monitoring, logging, and alerting for CI, GPU servers, Tegra boards, and lab services (e.g., Prometheus, Grafana, ELK style stacks).
  • Operate Tegra/Jetson testbeds used by CI and developers: provisioning, flashing, OS/JetPack updates, recovery, and reservation/scheduling.
  • Diagnose and resolve issues spanning power, networking, OS, containers, CI agents, and test infrastructure.

Benefits

  • equity
  • benefits
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service