As part of the Network Software and Services for AI (nssAI) team at xAI, you'll build cutting-edge software, services, and frameworks to empower our Network Development Engineers. Working hands-on, you’ll tackle all facets of network management—metric collection, configuration, zero-touch provisioning, monitoring, and auto-remediation—driving automation-first solutions for xAI’s production and ancillary networks. Expect to develop extensible tools, streamline complex processes, and ensure rock-solid reliability to support xAI’s mission of accelerating human scientific discovery through AI. Focus: Building software and tools with extensive metrics coverage for some of the world’s largest GPU supercomputing network fabrics used for AI training and serving customer inference queries. Implement IaC best practices, enhancing deployment pipelines, and ensuring robust, secure service delivery across our production environments.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed