Software Engineer - Networking Software and Services

xAIPalo Alto, CA
$150,000 - $250,000Hybrid

About The Position

As part of the Network Software and Services for AI (nssAI) team at xAI, you'll build cutting-edge software, services, and frameworks to empower our Network Development Engineers. Working hands-on, you’ll tackle all facets of network management—metric collection, configuration, zero-touch provisioning, monitoring, and auto-remediation—driving automation-first solutions for xAI’s production and ancillary networks. Expect to develop extensible tools, streamline complex processes, and ensure rock-solid reliability to support xAI’s mission of accelerating human scientific discovery through AI. Focus: Building software and tools with extensive metrics coverage for some of the world’s largest GPU supercomputing network fabrics used for AI training and serving customer inference queries. Implement IaC best practices, enhancing deployment pipelines, and ensuring robust, secure service delivery across our production environments.

Requirements

  • Python
  • Go
  • TCP/IP
  • BGP
  • RDMA
  • Deep experience collaborating with network engineers daily using extensive knowledge of network topologies, physical and logical, and network protocols.
  • Expert knowledge and proven history with designing scalable and reliable software from the ground up that can build and orchestrate tens of thousands of network devices at lightning speeds.
  • Ability to thrive in ambiguity, creating metrics that will help prioritize the focus of the team and your own.

Responsibilities

  • Build cutting-edge software, services, and frameworks to empower Network Development Engineers.
  • Tackle all facets of network management: metric collection, configuration, zero-touch provisioning, monitoring, and auto-remediation.
  • Drive automation-first solutions for xAI’s production and ancillary networks.
  • Develop extensible tools.
  • Streamline complex processes.
  • Ensure rock-solid reliability to support xAI’s mission.
  • Build software and tools with extensive metrics coverage for large GPU supercomputing network fabrics.
  • Implement IaC best practices.
  • Enhance deployment pipelines.
  • Ensure robust, secure service delivery across production environments.

Benefits

  • Equity
  • Comprehensive medical coverage
  • Vision coverage
  • Dental coverage
  • Access to a 401(k) retirement plan
  • Short-term disability insurance
  • Long-term disability insurance
  • Life insurance
  • Various other discounts and perks
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service