NVIDIA’s GPU Workload Efficiency (GWE) team is looking for a skilled Senior Engineer to enhance performance in training and inference. We are developing methods to improve the efficiency of AI workloads on NVIDIA GPUs. This position entails collaborating on GPU architecture, deep learning frameworks, and large-scale applications to optimize performance. Come aboard and be a part of a team that spearheads the evolution in AI computing! What you’ll be doing: Evaluating, explaining, and improving deep learning workloads for both training and inference, contributing to advancements in throughput, latency, and efficiency across NVIDIA GPU platforms. Collaborating across NVIDIA with researchers, engineers, and hardware specialists to recognize bottlenecks and achieve performance improvements. Developing production-quality software across the deep learning platform stack, from frameworks to deployment. Building automation and diagnostics that enable reproducible, scalable, and backend-agnostic performance improvements.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Number of Employees
5,001-10,000 employees