NVIDIA is looking for an outstanding, passionate, and dedicated Senior AI Infrastructure Engineer to join our DGX Cloud group. This engineering role will design, build and maintain large-scale production systems with high efficiency and availability using a combination of software and systems engineering practices. This role demands knowledge across different systems, networking, coding, databases, capacity management, continuous delivery and deployment, and open-source cloud-enabling technologies like Kubernetes and OpenStack. The DGX Cloud SRE at NVIDIA ensures our GPU cloud services deliver maximum reliability and uptime. They carefully prepare and plan changes to the system. They also manage capacity and performance. NVIDIA values diversity, curiosity, problem-solving, and openness. Our team includes people with varied backgrounds and perspectives. We encourage collaboration, big thinking, and risk-taking without blame. We promote self-direction on meaningful projects. We also provide support and mentorship to foster learning and growth.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior