In this role, you will be a contributing member of the OEM AI Factory SA team. Our work encompasses MEP (Mechanical Electrical Plumbing), Ethernet and Infiniband networking, DevOps, HPC/AI workloads, Cluster Administration and Site Reliability Engineering. You will acquire insight into various facets of AI Factories deployments. Applicants should be familiar with Linux system administration, Python, and networking concepts. Solid understanding of Slurm and data sciences is a plus. Our Team is responsible for OEM AI factory build engagements - which means that we work with our OEM partners (Dell, HPE, Lenovo and others) to use NVIDIA solutions integrated in their platforms. NVIDIA certified servers include GB200/300 NVL72, along with our software stack that assists with the deployment, configuring, validating and monitoring for the AI Factories of the future.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level