Systems Software Engineer - GPU

NVIDIARedmond, WA
1d

About The Position

NVIDIA is searching for a highly motivated, creative engineer to join the GPU Software team. As a GPU system software engineer, you will work with a team of very hardworking software and hardware engineers involving a wide variety of technologies. As someone who is hardworking and passionate about their work, you will design key aspects of our production GPU kernel drivers and embedded SW, focussing on GPU and platform level resiliency. You should demonstrate the ability to excel in an environment with sophisticated system level interactions. What you'll be doing: Design and implement changes in NVIDIA SW stack to improve system level resiliency and reliability at datacenter scale with thousands of GPUs. Focussing on adding features that bolster system level availability, early fault detection and faster recovery. You will follow the devices all the way through the development process to datacenter systems, customer desktops, notebooks, workstations, and gaming console products that are used throughout the world. Be heavily involved in architecture definition and early modeling, simulation required to create our groundbreaking products Multiple opportunities to collaborate and communicate effectively with teams from all around the globe NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you! NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 124,000 USD - 195,500 USD for Level 2, and 152,000 USD - 241,500 USD for Level 3. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until February 2, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA is the world leader in accelerated computing. NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society. Learn more about NVIDIA.

Requirements

  • BS or MS degree in Computer Engineering, Computer Science, or related degree (or equivalent experience)
  • Background in solving problems that apply to large complex systems deployed at scale
  • Strong C/C++ programming skills as well as having shown initiative in pursuing independent coding projects
  • Familiarity with computer system architecture, microprocessor, and microcontroller fundamentals (caches, buses, memory controllers, DMA, etc.)
  • Strong Operating systems fundamentals with Kernel experience on Linux or Windows systems
  • 2+ years of meaningful software development experience

Nice To Haves

  • Background and strength with sophisticated system-level debugging is invaluable
  • Deep understanding of memory management and virtualization concepts
  • Familiarity with kernel level security concepts
  • Experience with embedded system SW concepts, e.g.: RTOS and overlay programming models

Responsibilities

  • Design and implement changes in NVIDIA SW stack to improve system level resiliency and reliability at datacenter scale with thousands of GPUs.
  • Focussing on adding features that bolster system level availability, early fault detection and faster recovery.
  • Follow the devices all the way through the development process to datacenter systems, customer desktops, notebooks, workstations, and gaming console products that are used throughout the world.
  • Be heavily involved in architecture definition and early modeling, simulation required to create our groundbreaking products
  • Multiple opportunities to collaborate and communicate effectively with teams from all around the globe

Benefits

  • equity
  • benefits
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service