About The Position

NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world. We are looking for an excellent engineering manager to own and deliver an end to end manageability stack for Data Center Systems. We are seeking an experienced manager who is deeply technical, hands-on, and has a wide system view. You will manage a team of experts, design & build OpenBMC based manageability software stack for NVIDIA’s next generation Data Center Compute Systems. We want to grow our teams with the smartest people in the world. If you're creative and autonomous, we want to hear from you!

Requirements

  • BS, MS, or PhD in EE/CS or related field of education or equivalent experience.
  • 10+ overall years of relevant experience working on server firmware (BMC) and platform software development
  • 5+ years of experience in managing a software/firmware engineering team
  • Hands on experience with data center health management workflow.
  • Proven record of delivering server firmware for large data centers.
  • Strong knowledge of data center management, server architecture and server manageability in data centers.
  • Strong and demonstrable skill in C/C++ and Python.
  • Experience programming and debugging skills for server platforms.
  • Experience in SCM (e.g. Git, Perforce) and project management tools like Jira.
  • Possess excellent written and oral communication skills, good work ethics, high sense of team-work, love to produce quality work.
  • Self-starter who loves to find creative solutions to complicated problems

Nice To Haves

  • Hands on experience with BMC firmware/software stack for data center health management and server manageability.
  • Proven engineering managers driving large complex problem with 25+ engineers working

Responsibilities

  • Own and deliver OpenBMC based manageability stack for next generation Data Center Compute Systems.
  • Own firmware delivered to data centers in terms of quality, reliability and telemetry performance.
  • Manage and lead a distributed team of software engineers to deliver firmware stack with high quality.
  • Work with data center architects and cloud customers for correct requirements and scope implementation to ensure speed of light product development.
  • Work closely with cross functional teams to ensure scalable manageability architecture for all data centers products
  • Drive efficiency, reliability and optimization in firmware architecture from a data center view point.
  • Work closely with customers and internal teams to resolve issues at Speed of Light.

Benefits

  • You will also be eligible for equity and benefits

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Manager

Education Level

Ph.D. or professional degree

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service