HPC System Administrator -2 (HPC, Linux)

AkinaAnnapolis Junction, MD
1d

About The Position

The System Administrator will provide High Performance Computing (HPC) services in the form of HPC enhanced sustainment capabilities to two geographically dispersed areas. These capabilities include: - Systems running Red Hat, CentOS, SUSE and custom vendor-specific operating systems, with high-speed shared storage (lustre, and gpfs as examples), along with dedicated high-speed low latency network interconnects like Infiniband and Slingshot. - Transitioning systems into Operations. This team works across organizations to provide for planning, delivery, and integration of new or additional HPC sustainment capabilities that include HPC systems, servers (cluster, SMP, MPP and SPD), and parallel file systems. - System Administrators (HPC) must support The HPC and ABS (ABUNDANTSHIELD high speed shared parallel storage) SRE teams and follow Government designated policies and procedures, developed to enhance the teams’ ability to perform their sustainment responsibilities and to improve customer mission operations.

Requirements

  • B.S. in a technical discipline and 5 years’ experience as a System Administrator in programs and contracts of similar scope, type and complexity or 10 years’ experience in lieu of degree.
  • Provide Tier 1 (Help Desk) problem identification, diagnosis and resolution of problems
  • Provide Tier 1 (Help Desk) and Tier 2 (Escalation) problem identification, diagnosis and resolution of problems
  • Provide support to IT systems including day-to-day operations, monitoring and problem resolution for all of the client/server/storage/network devices, mobile devices, etc.
  • Provide support for the escalation and communication of status to agency management and internal customers
  • Provide detailed analysis and feedback to agency management and internal customers for escalated tickets
  • Provide support for the dispatch system and hardware problems and remains involved in the resolution process
  • DoD 8570 IAT II level certification required.
  • TS/SCI with FSP required.
  • Most recent poly in the last 7 years required.

Responsibilities

  • Provide support for implementation, troubleshooting and maintenance of IT systems
  • Manage the daily activities of configuration and operation of IT systems
  • Provide assistance to users in accessing and using IT systems
  • Optimize system operations and resource utilization, and perform system capacity analysis and planning
  • Provide in-depth experience in trouble-shooting IT systems
  • Configure and manage Linux, Unix, and Windows (or other applicable) operating systems and installs/loads operating system software, troubleshoot, maintain integrity of and configure network components, along with implementing operating systems enhancements to improve reliability and performance

Benefits

  • 24 days PTO accrued annually and 11 federal holidays.
  • Our 401k is 100% vested on your start date and the company makes a direct contribution worth 10% of your salary.
  • Akina covers 100% of healthcare costs for employees and 50% toward dependents.
  • We offer educational assistance towards college classes and will cover costs associated with job related training and certifications
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service