Linux (RHEL) Operating Systems Lead

TekSynapRockville, MD
Onsite

About The Position

The Linux (RHEL) Operating System Lead is responsible for the administration and optimization of Red Hat Enterprise Linux (RHEL) systems supporting the NRC’s High Performance Computing System (HPCS), including the Computational Fluid Dynamics (CFD) cluster and cloud-based environments.

Requirements

  • 7+ years of experience in Linux system administration (RHEL preferred)
  • Linux system performance tuning
  • Shell scripting (Bash, Python)
  • System monitoring tools
  • Experience supporting HPC environments or compute clusters
  • Virtualization (VMware)
  • Cloud environments (AWS GovCloud, Azure)
  • Knowledge of FISMA, NIST, and federal security standards
  • Strong troubleshooting and analytical skills
  • Bachelor’s degree in IT, Computer Science, or related field (or equivalent experience)
  • Must be able to obtain and maintain an NRC security clearance (Public Trust or higher)
  • U.S. Citizen

Nice To Haves

  • Red Hat certifications (RHCSA, RHCE)
  • HPC schedulers (e.g., Slurm, PBS)
  • Parallel computing environments
  • Experience with scientific/engineering tools (e.g., ANSYS, CFD applications)
  • Automation tools (Ansible, Puppet)

Responsibilities

  • Administer and maintain RHEL-based systems in standalone and cloud environments
  • Support HPC/CFD Linux cluster operations, including performance tuning and job scheduling
  • Install, configure, and maintain scientific and engineering applications (e.g., ANSYS, Nek5000)
  • Monitor system health, performance, and availability
  • Troubleshoot system failures and restore services quickly
  • Implement security patches, updates, and vulnerability remediation
  • Support continuous monitoring and FISMA compliance activities
  • Manage user accounts, permissions, and access controls
  • Configure networking, storage, and cluster components
  • Maintain system documentation, scripts, and operational procedures
  • Automate administrative tasks where possible
  • Support system upgrades, capacity planning, and scaling
  • Collaborate with stakeholders and participate in technical planning discussions
  • Monthly status and asset reporting
  • Participation in system health reviews and change control boards
  • Continuous improvement and modernization of systems
  • Maintain high availability and operational readiness of HPCS systems

Benefits

  • health
  • dental
  • vision
  • 401K
  • life insurance
  • short-term and long-term disability plans
  • vacation time
  • holidays
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service