Senior Linux Administrator

DESE ResearchHuntsville, AL
14h

About The Position

DESE Research, Inc. is seeking a Senior Linux Administrator to join our team in Huntsville, AL. In this role the candidate will: Architect & Deploy: Lead the design and lifecycle management of mission-critical Linux workstations, enterprise-grade servers, and high-performance computing (HPC) clusters. Engineer Filesystems: Master the art of data movement. Administer complex local and distributed filesystems (Lustre, GPFS/Spectrum Scale) to ensure extreme-speed access across the fabric. Infrastructure as Code (IaC): Treat the data center as a codebase. Develop sophisticated automation workflows using Python, Bash, and Ansible to eliminate manual toil and ensure drift-free configurations. Defensive Engineering: Implement "Hardened by Design" security. Fine-tune SELinux policies and advanced firewall configurations to protect sensitive data without sacrificing computational performance. Container Orchestration: Modernize scientific workflows by deploying and managing isolated environments using Podman while working to establish a Kubernetes environment. HPC Performance Tuning: Push the limits of the silicon. Optimize cluster scheduling and management utilizing industry-leading tools like Bright Cluster Manager and Slurm. Low-Latency Networking: Configure and optimize high-bandwidth networking, including InfiniBand fabrics, for seamless inter-node communication. Technical Documentation: Author high-fidelity playbooks and strategic architectural diagrams that serve as the blueprint for our evolving infrastructure

Requirements

  • Bachelor's degree in Computer Science, Math, Engineering, Physics, or STEM
  • Active DoD Top Secret with eligibility for SCI and a CI Scope Polygraph within 180 days of hire
  • Ability and willingness to obtain and maintain Special Access Program (SAP) eligibility
  • Active DoD 8570.01-M baseline certification (Security+ CE, SSCP, or equivalent)
  • Deep-tier professional experience in Linux systems engineering (RHEL/ /Rocky preferred)

Nice To Haves

  • Active TS/SCI clearance with a current CI Polygraph
  • RHCE, RHCSA, or similar
  • Direct experience tuning kernel parameters and MPI libraries for large-scale distributed computing
  • Expertise in VMware, Nutanix, or KVM within a heterogeneous environment that include Windows integration

Responsibilities

  • Lead the design and lifecycle management of mission-critical Linux workstations, enterprise-grade servers, and high-performance computing (HPC) clusters
  • Administer complex local and distributed filesystems (Lustre, GPFS/Spectrum Scale) to ensure extreme-speed access across the fabric
  • Develop sophisticated automation workflows using Python, Bash, and Ansible to eliminate manual toil and ensure drift-free configurations
  • Fine-tune SELinux policies and advanced firewall configurations to protect sensitive data without sacrificing computational performance
  • Deploy and manage isolated environments using Podman while working to establish a Kubernetes environment
  • Optimize cluster scheduling and management utilizing industry-leading tools like Bright Cluster Manager and Slurm
  • Configure and optimize high-bandwidth networking, including InfiniBand fabrics, for seamless inter-node communication
  • Author high-fidelity playbooks and strategic architectural diagrams that serve as the blueprint for our evolving infrastructure

Benefits

  • Competitive health, dental and vision insurance with affordable premiums
  • Flexible work schedules
  • Two different flexible spending account options
  • Company paid life insurance with options for employee paid additional
  • Performance bonus program
  • Education reimbursement program
  • Company paid personal leave for approved philanthropic activities
  • Vacation, Sick & Holiday leave
  • Robust 401k profit sharing plan
  • Opportunities for internal promotions
  • Employee referral incentive program
  • Rewards and gifts for service anniversaries
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service