Senior Software Architect, AI and HPC

NVIDIA-posted 8 days ago

Full-time • Senior

Us, CA

5,001-10,000 employees

Resume

Match Score

Upload and Match ResumeTrack Jobs with Teal

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. The software architecture group at NVIDIA has openings for software architects in the field of AI and high-performance networking and system software. We research, develop, and deploy solutions in networking hardware, programming environments, and system software to make current and future high-end computer systems more performant, scalable, and usable. Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new network hardware features. Research, design and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM), and Deep Learning frameworks such as TensorFlow/Pytorch. Review, design, and implement features to enhance compiler features to support the NVIDIA networking ecosystem. Research, design and develop hardware features relevant to scientific, Deep learning, and data-intensive workloads.

Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new network hardware features.
Research, design and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM), and Deep Learning frameworks such as TensorFlow/Pytorch.
Review, design, and implement features to enhance compiler features to support the NVIDIA networking ecosystem.
Research, design and develop hardware features relevant to scientific, Deep learning, and data-intensive workloads.

A Ph.D. or Master, in computer science, computer engineering, or a closely related field or equivalent experience.
5+ years of experience in parallel programming models, and/or network architecture
Background in algorithm design, system programming, and computer architecture
Strong programming and software development skills
Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment
Deep understanding of technology and passion for what you do
Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment

Background with designing communication middleware for high-performance computing systems, including InfiniBand, DPUs, Ethernet, and Shared Memory
Experience developing and implementing features for compilers, optimizations for compilers, particularly Clang/LLVM, and NVIDIA compilers
Experience implementing communications libraries, particularly MPI, OpenSHMEM, NCCL, NVSHMEM, UCX, UCC, or PGAS
Background with CUDA programming and NVIDIA GPUs Programming models for emerging architectures including hierarchical heterogeneous memory systems and accelerators.

You will also be eligible for equity and benefits .

Track Jobs with Teal

Job Search Resources

•

Resume Builder

•

Resume Examples

•

Cover Letter Examples

Senior Software Architect, AI and HPC

Job Search Resources

Tools

Career Hubs

Guides

Company