We are looking for an experienced HPC Storage Engineer to design, implement, and optimize the storage and data movement infrastructure that underpins our high-performance computing (HPC) environment. This role focuses on distributed and parallel filesystems, storage systems, and large-scale data movement, ensuring reliable, high-throughput access to data for compute-intensive workloads. You will work closely with HPC platform engineers, compute and networking teams, and application users to deliver scalable, performant, and resilient storage solutions that tightly integrate the storage layer with compute nodes. In this role, you will: Design, deploy, and operate HPC storage systems and parallel/distributed filesystems (e.g., Lustre, GPFS/IBM Spectrum Scale, BeeGFS, Ceph). Own data movement workflows across environments, including data ingest, replication, tiering, and archiving. Optimize filesystem and storage performance for large-scale parallel workloads. Design and tune load-balancing strategies across storage targets, metadata services, and data movement pipelines to ensure even utilization, high throughput, and predictable performance at scale. Troubleshoot storage, I/O, and data movement issues across HPC compute clusters. Develop and maintain automation for storage provisioning, monitoring, and lifecycle management. Partner with compute and networking teams to ensure end-to-end performance and reliability. Advise users and application teams on best practices for I/O patterns, data layout, and performance tuning. Evaluate and integrate new storage technologies and architectures as requirements evolve.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
1,001-5,000 employees