About The Position

NVIDIA has become the platform upon which every new AI-powered application is built. From healthcare research applications to autonomous vehicles, or voice-recognition systems, the need for advanced perception and cognitive capabilities is exploding, and NVIDIA is right in the center of this revolution. We are seeking a motivated Senior Systems Software Engineer to join our Autonomous Vehicle Infrastructure organization! This role focusses on building, deploying, and operating validation platforms at scale. You will work with internal teams and external partners to integrate distributed systems, manage large-scale data pipelines, and operationalize next-generation validation workflows for autonomous driving. This role offers a chance to start from the ground up: standing up new vendor-provided platforms, validating integration paths, and ensuring infrastructure is reliable, secure, and production-ready. You will combine hands-on engineering, infrastructure deployment, and workflow automation to help scale our AV validation ecosystem.

Requirements

  • BS/MS in Computer Science, Computer Engineering, or relevant field (or equivalent experience).
  • 5+ years of professional experience in infrastructure, distributed systems, or platform engineering.
  • Hands-on experience with Bazel build/test automation frameworks.
  • Proficiency in C++, Python, and Bash coding skills.
  • Strong background in Linux systems, distributed systems, and infrastructure engineering.
  • Knowledge of cloud and on-prem environments, i.e. Kubernetes, Docker, VM infrastructure.
  • Strong debugging, problem-solving, and communication skills to work across internal and vendor teams.
  • Proven comfort leveraging AI-based development tools, such as Claude Code and Cursor.
  • Problem-solving mindset: capable of debugging across the stack (infra, build system, workloads).

Nice To Haves

  • Prior experience with coverage frameworks (lCOV, Gcov, VectorCAST) and delivering quality metrics in compliance-heavy environments.
  • Hands-on experience with static analysis tooling like Coverity, and embedding it into developer workflows.
  • Background in safety-critical domains like automotive, with audit-driven workflows.
  • Experience in requirements management tools (e.g, Jama) or traceability workflows.
  • Strong experience in large-scale distributed systems and using AI to accelerate debug and integration workflows.

Responsibilities

  • Optimize and operationalize Bazel-based build/test pipelines, integrating with CI/CD frameworks (e.g. Jenkins, GitLab).
  • Enable developers with tools, wrappers, and automation using Go, Python, and Bazel that improve correctness, prevent regressions, and enforce quality gates before code is merged.
  • Provide mechanisms for automated analysis, triage, and reporting that helps developers and customers act on results quickly.
  • Deploy and operationalize vendor-provided platforms in our cloud-based service platform, starting with test environments to validate dependencies, workflows, and performance.
  • Provide visualization and reporting capabilities to surface validation results, coverage metrics, and actionable insights for developers and collaborators.
  • Communicate proactively with stakeholders, ensuring no issues are left unattended and infrastructure evolves alongside developer needs.
  • Partner closely with internal teams and external vendors to solve issues, refine SLAs, and continuously improve operational reliability and scalability.

Benefits

  • With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers.
  • You will also be eligible for equity and benefits.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service