Senior Systems Administrator

Point Solutions GroupHerndon, VA
11d

About The Position

PSG is seeking an experienced Senior Linux Systems Administrator (SME) to support mission-critical environments, including a Centralized Supercomputing Facility (CSCF) High Performance Computing (HPC) environment and other government programs. In this role, you will support large-scale Linux infrastructure, troubleshoot complex system issues, and help maintain highly secure, high-performance computing environments. You'll work closely with engineers, support teams, and mission users to ensure systems remain stable, secure, and optimized for performance. This is an excellent opportunity for a hands-on Linux expert who enjoys solving complex system problems and supporting advanced computing platforms. Locations: Littleton, CO • King of Prussia, PA / Valley Forge, PA • Herndon, VA • Springfield, VA

Requirements

  • Active TS/SCI with CI Polygraph (required to start)
  • Bachelor's degree in Computer Science, Information Systems, or related field
  • 10+ years of professional experience in Linux systems administration or related field
  • At least 5 years of experience working in heterogeneous, multi-platform computing environments
  • Strong experience with Linux system administration and troubleshooting
  • Proficiency with Bourne/Bash scripting and Expect scripting
  • Experience debugging Linux kernel issues, including reviewing source code and analyzing system logs
  • Experience using Linux debugging and performance tools such as crash and systemtap
  • Experience managing Satellite repositories for patching and kernel updates
  • Experience automating system configuration and maintenance using Ansible and Satellite Server
  • Experience with identity management systems, including LDAP and token-based authentication

Nice To Haves

  • Experience supporting High Performance Computing (HPC) environments
  • Experience working in classified or highly secure environments
  • Familiarity with large-scale Linux infrastructure or supercomputing systems
  • Experience improving system reliability and automation across enterprise environments

Responsibilities

  • Provide Linux system administration expertise supporting HPC and enterprise environments
  • Troubleshoot complex system and kernel-level issues using debugging tools and system logs
  • Support High Performance Computing (HPC) environments, including system performance and stability
  • Develop and maintain automation scripts to improve system management and operational efficiency
  • Manage and maintain Linux patching and repository management using Satellite Server
  • Build and maintain custom Linux kernels to address functionality or security requirements
  • Automate system configuration and maintenance using tools such as Ansible
  • Support and maintain identity and access management systems, including LDAP and token-based authentication
  • Collaborate with engineering and infrastructure teams to ensure system security, availability, and performance
  • Assist with diagnosing system outages, performance bottlenecks, and operational issues
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service