Senior HPC Administrator (IMC - 003)

SageCor SolutionsCollege Park, MD

About The Position

Serving Maryland and the Greater Washington D.C. area, SageCor Solutions (SageCor) is a growing company bringing complete engineering services and true full lifecycle System Engineering services to areas requiring (or desiring) nationally-recognized expertise in high performance computing, large data analytics and cutting edge information technologies. Active TS/SCI w/ Polygraph required.

Requirements

  • Active Top Secret/SCI clearance with polygraph
  • Experience administering Linux-based servers and HPC clusters, including job schedulers (e.g., Slurm, LSF, PBS)
  • Experience configuring and managing Virtual Private Network (VPN) clients and servers
  • Scripting/programming skills (C and Python)
  • Knowledge of System automation tools (e.g., Ansible)
  • Knowledge of System provisioning tools (e.g., Warewolf)
  • Knowledge of Distributed storage systems (e.g., Lustre, BeeGFS)
  • Knowledge of Containerization (e.g., Docker, Apptainer)
  • Knowledge of Installing, maintaining and using infrastructure and performance monitoring and optimization tools (e.g., Grafana, Prometheus)
  • Knowledge of Setting up and executing benchmarks in an HPC environment and analyzing their results systematically

Nice To Haves

  • Preferably meets DoD 8140.01 or DoD 8570.01-M training and certification requirements

Responsibilities

  • Configure and manage Linux and Windows (or other applicable) operating systems and installs/loads operating system software, troubleshoot, maintain integrity of and configure network components, along with implementing operating systems enhancements to improve security, reliability, and performance
  • Administer, monitor, and maintain HPC systems, including compute nodes, storage, networking, and software stacks
  • Provide support to IT systems including day-to-day operations, monitoring and problem resolution for all of the client/server/storage/network devices, mobile devices, etc.
  • Implement and maintain automation tools for system provisioning, configuration management, and monitoring.
  • Provide support for implementation, troubleshooting and maintenance of IT systems
  • Manage the daily activities of configuration and operation of IT systems
  • Provide assistance to users in accessing and using IT systems
  • Optimize system operations and resource utilization, and perform system capacity analysis and planning
  • Provide in-depth experience in trouble-shooting IT systems
  • Analyze and resolve complex problems associated with server hardware, applications and software integration
  • Contribute to performance benchmarking, system tuning, and capacity planning
  • Support researchers by providing technical expertise and resolving IT-related roadblocks or issues
  • Document system administration procedures and contribute to knowledge-sharing initiatives

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Education Level

No Education Listed

Number of Employees

1-10 employees

© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service