HPC Storage Systems Team Lead

Argonne National LaboratoryLemont, IL
5d$116,938 - $182,424Onsite

About The Position

Argonne is a multidisciplinary science and engineering research center, where “dream teams” of world-class researchers work alongside experts from industry, academia and other government laboratories to address vital national challenges in clean energy, environment, technology and national security. The HPC Storage Systems Team Lead provides team leadership and overall technical planning for projects related to the HPC Storage team of the Argonne Leadership Computing Facility (ALCF). This role involves: The technical operation, administration, and maintenance of hardware and software components of disk storage and file systems for the facility’s supercomputing resources. The team lead will also lead efforts in the design and planning for future machine upgrades and recommend improved approaches for providing disk storage and file system services. Provide rigorous oversight of design, testing, and deployment for storage solutions while mentoring team members and stakeholders on emerging storage technologies. Work mostly on-site in Lemont, Illinois

Requirements

  • PTL1: Master’s degree and 6+ years of experience, or Bachelor’s degree and 10+ years of experience in computer science or engineering preferred.
  • To perform the essential functions of this position successful applicants must provide proof of U.S. citizenship, which is required to comply with federal regulations and contract.
  • Ability to lead a team of individual contributors to integrate storage solutions with compute clusters using advanced networking (InfiniBand, Slingshot, Ultra Ethernet).
  • Comprehensive expertise in storage performance through strategic tuning of configurations, including NVMe-oF, metadata management, and tiering strategies.
  • Integrate parallel file systems (e.g., Lustre, Spectrum Scale) and distributed storage solutions, ensuring seamless compatibility with existing hardware and software ecosystems.
  • Demonstrated leadership experience in designing, planning, and implementing storage systems at multi-PB scale, including state-of-the-art parallel file systems.
  • Deep technical expertise in Linux administration, storage hardware (NVMe SSDs, high-capacity disk arrays), storage protocols (e.g., NFS, Lustre, S3, HPSS), and vendor solutions (e.g., VAST, WEKA, DDN, Spectra Logic).
  • In-depth understanding of high-speed networking technologies and integration challenges within HPC clusters.
  • Advanced proficiency of scripting languages (Python, Bash) and system automation to streamline storage operations and troubleshooting.
  • Ability to develop goals and objectives focused on the success of the HPC storage team.
  • Excellent communication skills and the ability to engage with leaders and other managers as part of the decision-making process.
  • Experience overseeing large-scale projects, with significant organizational impact and accountability for high-stakes deliverables.
  • Ability to model Argonne’s core values of impact, safety, respect, integrity and teamwork

Responsibilities

  • Technical operation, administration, and maintenance of hardware and software components of disk storage and file systems for the facility’s supercomputing resources.
  • Lead efforts in the design and planning for future machine upgrades and recommend improved approaches for providing disk storage and file system services.
  • Provide rigorous oversight of design, testing, and deployment for storage solutions while mentoring team members and stakeholders on emerging storage technologies.

Benefits

  • comprehensive benefits are part of the total rewards package.
  • Click here to view Argonne employee benefits!
  • As an equal employment opportunity employer, and in accordance with our core values of impact, safety, respect, integrity and teamwork, Argonne National Laboratory is committed to a safe and welcoming workplace that fosters collaborative scientific discovery and innovation.
  • Argonne encourages everyone to apply for employment.
  • Argonne is committed to nondiscrimination and considers all qualified applicants for employment without regard to any characteristic protected by law.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

1,001-5,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service