Research Scientist, Efficient Deep Learning - New College Grad 2026

NVIDIA•Us, CA

13h

About The Position

NVIDIA is searching for an outstanding researcher working on efficient deep learning to join the deep learning efficiency research team. We are passionate about research that pushes boundaries but also has impact in the real world. We are particularly excited about methods for post-training model optimization (pruning, quantization, NAS), efficient architecture design, adaptive/dynamic inference, resource-efficient training and finetuning, and so forth. You will work within an amazing and collaborative research team that consistently publishes at the top venues in computer vision and machine learning. Our existing expertise includes computer vision, deep learning, generative models, and so forth. Your contributions have the chance to create real impact on our products.

Requirements

Completing or recently completed a Ph.D. in Computer Science/Engineering, Electrical Engineering, etc., or have equivalent research experience.
Excellent knowledge of theory and practice of computer vision methods, as well as deep learning.
Experience with large language models and large vision-language models is required.
Excellent programming skills in Python and PyTorch; C++ and parallel programming (e.g., CUDA) is a plus.
Hands-on experience with large-scale model training including data preparation and model parallelization (tensor and pipeline) is required.
Outstanding research track record.
Excellent communications skills.