NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a senior engineer to design and build factory infrastructure and automation for NVIDIA Inference Microservices (NIMs). The right person for this role brings technical drive and creativity to change the way NVIDIA optimizes and serves performant inferencing for every AI model in a heterogeneous cluster environments. Our NIM offerings are easy to use, highly performant and tested in all deployment scenarios, in the cloud, on customer’s self-hosted infrastructure and locally on all NVIDIA GPUs. You will apply your deep technical expertise to design an efficient, scalable and reliable automation factory infrastructure that will take AI models to become NIMs that are validated for best in class performance and accuracy. NVIDIA is building a new category of products, by intersecting our prowess in deep learning and computing, with industry-leading technologies. You will harness groundbreaking technologies, and build a highly efficient factory to power how NVIDIA builds and validates NIMs for inferencing all the way through deployment in heterogeneous hardware and software environments. You will influence and drive technical advances in NVIDIAs workflows and build the infrastructure that strives to accelerate the delivery of every AI model on NVIDIA's GPUs anywhere. We are looking for technical talent to design and build our factory capabilities, including the underlying infrastructure, pipelines, backends, Docker build, test harness, metrics, performance engineering, log ingestion, and more.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees