Associated Universities Inc.-posted about 1 month ago
Full-time • Mid Level
Socorro, NM
501-1,000 employees

The National Radio Astronomy Observatory (NRAO) is a prestigious research and development organization that plays a vital role in the study of the universe. The Observatory is a hub for technological and scientific collaboration, operating state-of-the-art radio telescope facilities for use by the international scientific community. The NRAO also host conferences and workshops, providing opportunities to exchange ideas and expertise as well as build partnerships. The National Radio Astronomy Observatory seeks an experienced Systems Administrator (Level III or IV, based on qualifications) to maintain the Red Hat Enterprise Linux infrastructure supporting the end-to-end science data pipeline for NRAO’s flagship observatories. This senior role combines deep systems expertise with operational ownership, mentoring, and direct contribution to mission-critical science delivery. The position is based in Albuquerque or Socorro, NM.

  • Contribute to the design, implementation, and lifecycle management of RHEL-based systems supporting processing and archival science data flows across global observatories.
  • Lead the transition to GitOps-driven infrastructure and application deployment, striving for consistency, auditability, and reproducibility.
  • Migrate legacy science services from Docker Swarm to future environments based in kubernetes.
  • Develop and maintain automation tools in Python and SQL to monitor data pipeline health, generate operational metrics, and trigger reliable alerts.
  • Serve as Level-3 escalation for production incidents; conduct root-cause analysis, author post-mortem reports, and implement preventive measures.
  • Triage and resolve escalated support tickets, providing timely, astronomer-facing status communications during incidents.
  • Participate in agile development cycles (2-week sprints, daily stand-ups, Jira/Confluence) to deliver measurable improvements in stakeholder projects.
  • Validate software releases, prepare deployment packages, and produce comprehensive user documentation and training materials.
  • Contribute to the NRAO Common Computing Environment (CCE) initiative for cross-site standardization and knowledge transfer.
  • Mentor junior and peer administrators in infrastructure-as-code, automation, and operational best practices.
  • Travel occasionally to NRAO sites, including the Very Large Array (VLA), Atacama Large Millimeter/submillimeter Array (ALMA) in Chile, and international operations centers.
  • You have a Bachelor’s degree in Computer Science, Information Systems, Astronomy, Physics, or equivalent professional experience.
  • You are a seasoned Linux systems administrator with at least four years of progressive responsibility in mission-critical or scientific computing environments
  • Expert in Red Hat Enterprise Linux 8/9; RHCE or RHCA certification strongly preferred.
  • Proficient in modern infrastructure automation and orchestration: Ansible Automation Platform (playbooks, collections, Execution Environments) GitOps workflows using ArgoCD or Flux Production container platforms (Kubernetes/OpenShift and Docker Swarm)
  • Skilled in Python 3 automation and SQL (PostgreSQL, Oracle, MySQL) for monitoring, reporting, and system health.
  • Experienced with Telegraf, Prometheus, and alerting systems in operational settings.
  • A clear communicator capable of translating complex technical conditions into concise updates for astronomers, project leads, and senior management.
  • Observatory employees must be authorized to work in the United States. The Observatory presently cannot sponsor H-1B Visas for this position
  • Must be able to lift 25 lbs, climb stairs, and occasionally work at moderate altitudes (up to 7,000 ft / 2,134 m at the VLA site).
  • Direct experience with high-data-rate scientific pipelines (radio astronomy, genomics, earth observation, or similar).
  • Working knowledge of VictoriaMetrics, Ceph, SLURM, Prometheus/Grafana/Loki stacks.
  • Familiarity with both agile (Scrum/Kanban) and traditional waterfall project methodologies.
  • strong communication skills (written and verbal); ability to remain calm while supporting demanding clients; analytical thinker; ability to learn new systems quickly.
  • AUI provides excellent paid time off (13 holidays, annual accrual of up to 24 vacation days and 15 sick days, additional time off for doctor/dentist visits, and 8 weeks of paid parental leave).
  • Medical, dental and vision plans are effective on the first day of employment.
  • AUI’s retirement benefit contributes an amount equal to 10 percent of a qualified participant’s base pay with no required employee contribution; we also offer an optional supplemental, tax-deferred plan for employee retirement contributions.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service