Senior Machine Learning Engineer, Recipe Pathfinding

NvidiaRedmond, WA
105d$184,000 - $356,500

About The Position

NVIDIA is seeking a Senior Machine Learning software engineer to discover and innovate new low-precision and sparsity recipes in the pretraining setting. We are a team committed to developing next-generation software to make use of novel hardware features on current GPUs. We also provide guidance for design of next-gen GPU features. The job scope spans recipe design for all phases of the LLM life cycle: pre training, post training, and generation. Making these recipes generic and accurate is critical for adoption. Your work will be a component of our SW productization story in libraries like Megatron-LM, Transformer Engine, cuDNN, cuBLAS, etc.

Requirements

  • PhD or M.S. degree (or equivalent experience) in Computer Science or a related field
  • 5+ years of relevant software engineering experience
  • Proficient in Python
  • Experience with PyTorch or similar framework
  • Solid foundation in LLM pre training, post training, or generation
  • Proficient in the math of machine learning
  • Strong written and oral communication skills

Nice To Haves

  • Proficient in precision and numerics for ML
  • Familiarity with FP8 and MX formats for training
  • Strong programming skills and ability to debug ML systems

Responsibilities

  • Keep abreast on quantized LLM training research
  • Build robust and reproducible training recipes
  • Collaborate closely with hardware, software, and research teams to assess and adopt deep learning algorithmic advancements in quantization
  • Work with production SW teams to realize recipes in production workflows

Benefits

  • Equity
  • Comprehensive health benefits
  • 401k plan
  • Paid time off
  • Flexible working hours

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Computer and Electronic Product Manufacturing

Education Level

Ph.D. or professional degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service