About The Position

We are looking for a top-tier Solution Architect to join the growing NVIDIA AI Infrastructure team. NVIDIA is the world leader in computer graphics, artificial intelligence, and accelerated computing. We're in search of creative minds like you to help customers design, architect and implement accelerated computing data center solutions that will power workflows including AI inference at scale and physical AI through digital simulation. NVIDIA solutions are built around industry leading AI software tools, but these tools are only as good as the infrastructure that enable the workloads. On this team, you will do full stack design including hardware architecture, workload orchestration and application performance tuning. At NVIDIA, you will be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

Requirements

  • Bachelor's degree or equivalent experience in Engineering or Computer Science
  • 15+ years of meaningful work experience, ideally in an IT infrastructure or related field of expertise
  • An outstanding passion for groundbreaking IT infrastructures that optimize AI workloads
  • Expertise with infrastructure management including Linux, Kubernetes, Ethernet networking, and tools built for cloud environments
  • Strong experience with Linux system environments, using Linux as a core operating system for IT workload delivery
  • Confident with Python programming, specifically interfacing with IT infrastructure through API, SDKs and libraries
  • Mastery of how software and hardware work together to optimize applications
  • Confident experience with AI workloads, with an emphasis on inference workloads
  • Ability to work independently with a remote team with minimal direction
  • Outstanding communication skills, strong interpersonal and be an excellent teammate!

Nice To Haves

  • Experience with on-prem infrastructure architecture and large-scale cloud deployments
  • Experience with cloud native tooling including Terraform, Kubernetes, Helm
  • Background in building large scale infrastructure that deliver workloads via containers
  • Experience optimizing and troubleshooting performance of compute infrastructure
  • Deploying AI agents to increase productivity and accelerate content delivery
  • Critical thinking capabilities that leverage fundamentals to deduce solutions to unforeseen problems

Responsibilities

  • Help customers with their AI factory journey, including workflow pipelines and performance optimization
  • Focus data center implementations for inference use cases, including distributed, disaggregated and scaled out workflows
  • Scope physical AI journeys on Omniverse, including synthetic data generation, data aggregation, application development and simulation pipelines
  • Lead technical sales activities for AI factories with focus on hybrid deployments between cloud and on-prem
  • Deliver hybrid cloud architectures for data pipelines, storage, security and user streaming connectivity
  • Providing expertise in infrastructure workflows, including hardware, coordination of workload processes, and application tuning
  • Understand different solutions trade-offs and propose enterprise customers the best architecture and technical execution
  • Work directly with key customers to understand workflows and share feedback with internal product and engineering teams

Benefits

  • equity
  • benefits

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service