About The Position

NVIDIA is seeking a Senior Systems Software Engineer to build cloud-native platform software harnessing open-source container runtimes and Kubernetes. You will design and deliver capabilities that make GPU / DPU accelerated applications easier to develop, deploy, operate, and observe. We value strong programming skills, deep systems knowledge, distributed systems and DevOps experience, and the ability to turn ideas into high-quality, shippable software. NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence. NVIDIA is widely considered to be one of the world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're a creative, curious, and driven technical leader, we want to hear from you!

Requirements

  • BS/MS in Computer Science or related field, or equivalent experience.
  • 8+ years of relevant software engineering experience.
  • Expert-level proficiency in a systems language and strong understanding of data structures and algorithms.
  • Strong understanding of container orchestration systems (K8s).
  • Expert knowledge and experience working with Unix/Linux.
  • Automation experience with configuration management and infrastructure as code.
  • Experience setting up, maintaining, and automating CI/CD systems.
  • Background in cloud computing and distributed system design and development.
  • Understanding performance, security, and reliability in distributed environments.

Nice To Haves

  • Broad experience with Golang and Kubernetes controllers.
  • Strong background with GitLab, Jenkins, and/or other CI/CD systems.
  • Significant hands-on production experience with containers and Kubernetes.

Responsibilities

  • Design and implement platform software enabling GPUs as first-class resources in Kubernetes.
  • Build features and integrations that streamline the lifecycle of accelerated Kubernetes clusters (build, scale, and monitor).
  • Collaborate across NVIDIA to ensure seamless integration throughout the software and hardware stack.
  • Automate and optimize build, test, integration, and release pipelines for cloud-native services.
  • Diagnose and improve performance, reliability, and security across complex distributed systems.
  • Manage multiple priorities while maintaining high code quality and clear documentation.

Benefits

  • You will also be eligible for equity and benefits .

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service