We are seeking a Senior System Software Engineer to own and advance the AI-Perf analysis, NVIDIA’s flagship framework for benchmarking, experimentation, and analysis of LLMs, Generative AI, and deep learning inference workloads. In this role, you’ll combine systems research, distributed systems engineering, and applied AI, enabling reproducible performance evaluation, influencing internal platforms, and providing tooling that empowers researchers and engineers globally. What you’ll be doing: Lead the design, development, and roadmap of AI-Perf, defining benchmarking methodologies, performance metrics, and reproducible experimental workflows. Build scalable and high-performance features to measure latency, throughput, and efficiency across AI models and distributed systems. Partner with AI researchers, platform teams, and engineers to translate experimental challenges into robust, user-friendly performance tooling. Integrate AI-Perf with the Dynamo Inference Stack, other NVIDIA inference stacks, and open-source inference frameworks, delivering end-to-end performance insights for researchers and production users.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees