NVIDIA is seeking a Senior Machine Learning software engineer to discover and innovate new low-precision and sparsity recipes in the pretraining setting. We are a team committed to developing next-generation software to make use of novel hardware features on current GPUs. We also provide guidance for design of next-gen GPU features. The job scope spans recipe design for all phases of the LLM life cycle: pre training, post training, and generation. Making these recipes generic and accurate is critical for adoption. Your work will be a component of our SW productization story in libraries like Megatron-LM, Transformer Engine, cuDNN, cuBLAS, etc.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Industry
Computer and Electronic Product Manufacturing
Education Level
Ph.D. or professional degree