Senior Generative AI Research Engineer

NvidiaSanta Clara, CA
70d$224,000 - $356,500

About The Position

At NVIDIA, we're not just building the future, we're generating it. Our Cosmos generative AI engineering team is pushing the boundaries of what's possible across multimodal learning, video generation, synthetic data, intelligent simulation, and agentic systems. We are looking for exceptionally driven engineers and applied scientists with deep experience in generative modeling to help define the next era of AI computing.

Requirements

  • Minimum 8 years industry experience or 5+ years research/postdoc in building and deploying generative AI systems.
  • Proficiency in PyTorch, JAX, or other deep learning frameworks.
  • Expertise in one or more of: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems.
  • Intimate familiarity with all variants of the attention mechanisms in transformer architectures.
  • Hands-on experience with large scale training (e.g., ZeRO, DDP, FSDP, TP, CP) and data processing (e.g. Ray, Spark).
  • Production-quality software engineering skills in Python.
  • MS or PhD or equivalent experience in Computer Science, Machine Learning, Applied Math, Physics, or a related field.
  • 12+ years of relevant software development experience.

Nice To Haves

  • Familiarity with high-performance computing and GPU acceleration.
  • Contributions to influential open-source libraries or influential conference publications (NeurIPS, ICML, CVPR, ICLR).
  • Experience working with multimodal data (e.g., vision-language, VLA, audio).
  • Prior work with NVIDIA GPU-based compute clusters or simulation environments.

Responsibilities

  • Design and post-train foundation models (LLMs, VLMs, VLAs and DiTs) for real world applications.
  • Contribute to highly-collaborative development on large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines.
  • Work with teams in research, software, and product to bring world models from idea to deployment.
  • Collaborate on open-source and internal projects, author technical papers or patents, and mentor junior engineers.
  • Prototype and iterate rapidly on experiments across cutting-edge AI domains, including agentic systems, reinforcement learning, reasoning, and video generation.
  • Design and implement model distillation algorithms for size reduction and diffusion step optimization.
  • Profile and benchmark training and inference pipelines to achieve production-ready performance requirements.

Benefits

  • Equity and benefits.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Computer and Electronic Product Manufacturing

Education Level

Master's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service