We are seeking an experienced Senior Systems Engineer with US Government Top Secret/SCI security clearance with Polygraph to support a small standalone system dedicated to high-performance computing (HPC) and artificial intelligence (AI) workloads. This role demands a blend of operational expertise and strategic technical vision, focusing on the management and optimization of our standalone HPC/AI system. The ideal candidate will manage the technical operation of our infrastructure, develop standardized procedures for hardware, network, and software management across the system, and expertly oversee cluster management (including provisioning, optimization, and monitoring of clustered resources for HPC/AI workloads, such as NVIDIA BCM). What will you do? This position requires broad expertise in HPC/AI system administration, with a focus on: Refining infrastructure management frameworks Traditional infrastructure management (hardware, networking, directory services) Modern HPC/AI support (Linux/Ubuntu, Proxmox, NVIDIA BCM, WEKA storage) Designing scalable, secure, and highly available system architectures
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level