About The Position

The NVIDIA Experience (NVEX) Solutions Engineering team is seeking a senior Computer or Software Engineer to become an expert in advanced network technology used in AI clusters. This team of software engineers acts as a liaison between customer support and R&D, focusing on resolving complex issues from the field and providing top-tier support for InfiniBand, NVLink, and Spectrum-X network systems that connect GPUs and AI compute infrastructure. The role demands a strong software engineering background for in-depth code and logic analysis, along with proven experience in production network operations and close collaboration with customer engineering teams and internal R&D to diagnose and resolve critical problems. The successful candidate must be adept at interacting and communicating technical information to diverse groups, including R&D engineers, support teams, account teams, and executive management. Daily interaction with both internal and external customers necessitates excellent interpersonal and communication skills. Additionally, team members will have opportunities to work within R&D groups on feature development and bug fixes to directly improve the products they support.

Requirements

  • Software development experience in the networking industry either for a network hardware manufacturer or software integrator
  • Experience working directly with customers and support groups to triage and resolve critical in-field, in-production networking problems
  • Bachelor's degree in software engineering, computer engineering or related (or equivalent experience)
  • 5+ years of experience developing software (C, C++, Python, or Go)
  • 5+ years of experience directly supporting end-customers, partners, or integrators for network equipment and infrastructures
  • Experience analyzing, developing, and debugging at least 2 of the following: Linux NIC drivers, switch ASICs and SDKs, embedded network device firmware, Linux based network equipment (routers, switches, gateways, etc), network operating systems
  • Expert knowledge of Ethernet and IP routing down to the protocol’s byte level
  • Ability to analyze and learn evolving end-to-end systems flow, quickly learn how unfamiliar software layers are integrated, and be able to find logic bugs within an increasingly complex environment
  • Passion and motivation to isolate, root cause, and resolve high impact issues with executive visibility
  • Professional-level communication skills, including adjusting communication to the technical level of the audience, and staying calm and focused in negative situations

Responsibilities

  • Assist various network and AI cluster support teams in reproducing, resolving, and root causing sophisticated customer issues
  • Work with NVIDIA R&D teams to rapidly develop bug fixes, workarounds, and solutions for customers using NVIDIA’s network technologies
  • Work directly with customer engineers on live triage of critical and high-profile issues
  • Become an authority in NVIDIA Spectrum-X networking used in AI clusters
  • Develop support and analysis tools to help analyze and root cause field issues
  • Daily use of ground breaking AI tools for software development, log and trace analysis, and source code debugging
  • Spend a portion of time working directly within an R&D team developing core software
  • Occasional work on weekends or holidays to support critical customers

Benefits

  • equity
  • benefits
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service