Senior Software Engineer - NVIDIA Warp

NVIDIASanta Clara, CA

About The Position

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. Are you passionate about robotics and ready to expand embedded systems? At NVIDIA, we're developing the future of AI and robotics. Our team is advancing GPU-accelerated simulation and robotics algorithms with NVIDIA Warp. Join us and collaborate with an experienced group committed to pushing the limits in robotics.

Requirements

  • B.Sc. or M.Sc. or Ph.D. or equivalent experience in Computer Science, Computer Engineering, Robotics, Applied Math, Physics, or a related field.
  • 8+ years of experience with software engineering skills in C++ and Python, comfortable working across build systems and deployment tooling.
  • Experience shipping software to embedded or edge devices, ideally in robotics, autonomy, or real-time systems.
  • Practical understanding of Linux-based deployment workflows including packaging, dependencies, drivers, and debugging in constrained environments.
  • Ability to reason about GPU performance and memory behavior, and to diagnose bottlenecks using profiling and system tools.
  • Strong collaboration and communication skills, with a bias toward execution and unblocking users.

Responsibilities

  • Own and improve the end-to-end path for deploying Warp-based robotics and simulation components onto embedded platforms such as Jetson.
  • Build and maintain reproducible deployment workflows including cross-compilation, CI, packaging, and containerized delivery for embedded robotics targets.
  • Optimize on-device performance under real constraints including latency, throughput, memory footprint, thermals, and power.
  • Debug complex issues across the stack spanning Python, C++, CUDA, drivers, and embedded Linux, including hard-to-reproduce device-specific failures.
  • Integrate Warp components into robotics applications and frameworks, including ROS 2 and Isaac-based stacks, and work with partner teams to unblock adoption.
  • Develop system-level testing, validation, and performance regression infrastructure for embedded targets.
  • Collaborate with compiler, runtime, and kernel engineers to improve portability and performance across GPU architectures and embedded configurations.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service