Infrastructure Manager (Linux/HCI/Jira-Driven Operations)

ASRC FederalMountain View, CA
Onsite

About The Position

ASRC Federal InuTeq is seeking a highly capable Infrastructure Manager in Mountain View, CA to support the NASA Advanced Supercomputing (NAS) facility, home to some of the world’s fastest supercomputers, including Athena, Aitken, Cabeus, and Electra. This role operates within a heavily Linux-based ecosystem, leveraging Hyperconverged Infrastructure (HCI) and Jira-based project and task management to drive operational excellence. The ideal candidate brings a balance of people leadership, technical depth, and execution discipline, with a proven ability to manage 24x7 operations while delivering strategic infrastructure improvements.

Requirements

  • U.S. Citizenship required; must be able to pass NASA background checks
  • Bachelor’s degree in computer science, engineering, or a related technical field (or equivalent experience)
  • 7+ years supporting Linux-based, networked computing environments
  • 3+ years leading technical teams in a production environment
  • Strong experience with: Linux system administration (RHEL, Ubuntu, or similar)
  • Strong experience with: Hyperconverged Infrastructure (e.g., VMware vSAN, Nutanix, or similar)
  • Strong experience with: Automation tools (e.g., Ansible, scripting, or equivalent)
  • Strong experience with: OS patching and lifecycle management
  • Proven experience managing projects using Jira or MS Project
  • Experience working with vendors on technology evaluation and procurement
  • Strong understanding of operational risk management and high-availability
  • Excellence in Building and Managing Relationships is essential.
  • The ability to establish Client Trust and Value is crucial

Nice To Haves

  • Experience in government or regulated environments (e.g., NASA, DoD, or federal programs)
  • Programming or scripting experience (e.g., Python, Bash, C, C++, Java, Perl)
  • Linux certifications
  • Familiarity with ITIL-based service management practices
  • Exposure to DevOps practices and infrastructure as code (IaC)
  • Experience implementing monitoring and observability platforms

Responsibilities

  • Lead, mentor, and develop a team of system administrators supporting a 24x7 Linux-based (Red Hat) infrastructure environment
  • Foster a mission-driven culture centered on reliability, accountability, and continuous improvement
  • Plan staffing, on-call rotations, and coverage models to ensure uninterrupted operations
  • Drive performance management, hiring, coaching, and career development activities
  • Ensure all contractual and program deliverables are executed on time and aligned with stakeholder expectations
  • Manage a team of 4–5 system administrators, providing day-to-day supervision, technical direction, training, documentation, and support to ensure operational excellence across virtual machines, physical servers, and end-user systems in a Linux-dominant environment
  • Establish and enforce operational priorities focused on availability, performance, and system stability
  • Lead incident response, outage management, and service restoration efforts
  • Coordinate with stakeholders to balance risk, performance, and mission requirements
  • Ensure consistent execution of patching, OS lifecycle management, and system hardening practices
  • Drive infrastructure projects using Jira for planning, tracking, and reporting across tasks, sprints, and deliverables
  • Lead cross-functional initiatives involving HCI platforms, automation, monitoring, and system modernization
  • Ensure transparency and accountability through well-managed backlogs, workflows, and reporting metrics
  • Partner with HPC operation team, security, and stakeholders to deliver projects on time and within budget
  • Promote disciplined change management (Remedy) and formal request processes
  • Provide technical and strategic oversight for: Linux-based platforms and services, Hyperconverged Infrastructure (HCI) environments, Monitoring, logging, and automation frameworks
  • Define and standardize system configurations, operational procedures, and lifecycle management practices
  • Identify and implement opportunities to improve efficiency, scalability, and reliability
  • Ensure consistency across infrastructure stacks and operational processes

Benefits

  • strong career development
  • practical technical training

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

501-1,000 employees

© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service