NVIDIA is looking for outstanding software engineers to help us expand our enterprise GPU management and monitoring tools. In this role, you will work closely with the broader NVIDIA team to design and build cloud-native management agents, Kubernetes integrations, and end-to-end integration solutions that combine GPUs with the rest of the datacenter software management ecosystem. We are focused on supporting NVIDIA products across HPC, cloud, and enterprise on both bare metal and virtualized platforms as the role of GPUs in all of these environments expands. Your contributions will span many aspects of GPU system integration, including telemetry and metrics, health checks, diagnostics, configuration, and system management. These tools fill roles of both passive background monitoring and active online management with a core emphasis on operational transparency and seamless integration in customer environments. Your code will support single-node developer systems through large clusters with thousands of nodes. To succeed, you must have a strong Linux background, familiarity with modern cloud-native systems, and a proven work ethic. You will be expected to jump in quickly and provide valuable contributions from day one. This is a dynamic work environment with many exciting opportunities awaiting. NVIDIA GPUs are central to many hot enterprise, cloud, and datacenter trends. Come join us as we craft the future of accelerated computing and AI.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees