About The Position

NVIDIA has become the platform upon which every new AI-powered application is built. From healthcare research applications to autonomous vehicles, or voice-recognition systems, the need for advanced perception and cognitive capabilities is exploding, and NVIDIA is right in the center of this revolution. We are seeking a motivated Senior Systems Software Engineer to join our Autonomous Vehicle Infrastructure organization and participate in accelerating the validation and quality of our Autonomous Vehicle (AV) software stack, including coverage, analysis, and tooling development. This role blends infrastructure engineering with developer enablement — ensuring our build/test environments are reliable, scalable, and aligned with safety-critical standards. The ideal candidate is hands-on, adaptable, and eager to bridge infra with developer workflows in a fast-moving environment.

Requirements

  • BS/MS in Computer Science, Computer Engineering, or relevant field (or equivalent experience).
  • 5+ years of professional experience in infrastructure, distributed systems, or platform engineering.
  • Strong background in Linux systems, distributed systems, and infrastructure engineering.
  • Hands-on experience with Bazel build system and its integration into CI/CD pipelines.
  • Proficiency in C++, Python and Bash.
  • Experience with PostgreSQL and data handling at scale.
  • Knowledge of cloud and on-prem environments: Kubernetes, Docker, VM infrastructure.
  • Familiarity with logging, monitoring, and alerting stacks (Grafana, Prometheus, ELK stack).
  • Ability to collaborate across teams and communicate effectively with developers and stakeholders.
  • Problem-solving mindset: capable of debugging across the stack (infra, build system, workloads).

Nice To Haves

  • Prior experience with coverage frameworks (lcov, gcov, VectorCAST) and delivering quality metrics in compliance-heavy environments.
  • Hands-on experience with static analysis tooling like Coverity, and embedding it into developer workflows.
  • Background in safety-critical domains like automotive, with audit-driven workflows.
  • Experience in requirements management tools (Codebeamer/Jama) or traceability workflows.
  • Familiarity with AI-assisted tooling (LLMs, code assistants, automation bots) for accelerating infra and developer workflows.

Responsibilities

  • Design, deploy, and maintain distributed infrastructure to support AV software builds, simulation, and validation.
  • Operate and optimize Bazel-based build/test pipelines, integrating with CI/CD frameworks (e.g. GitLab, Jenkins).
  • Support large-scale data and service workflows with a focus on performance, scalability, and reliability.
  • Enable developers with tools, wrappers, and automation that improve correctness, prevent regressions, and enforce quality gates before code is merged.
  • Provide mechanisms for automated analysis, triage, and reporting that help developers and stakeholders act on results quickly.
  • Build dashboards and metrics for system health, workload quality, and resource utilization across compute and storage environments.
  • Communicate proactively with stakeholders, ensuring no issues are left unattended and infra evolves alongside developer needs.

Benefits

  • Highly competitive salaries
  • Comprehensive benefits package
  • Equity eligibility

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Computer and Electronic Product Manufacturing

Education Level

Bachelor's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service