Senior Systems Engineer

The Britton GroupWashington, DC

About The Position

This position supports a mission-critical analytics and research platform built on a Linux-based high-performance computing (HPC) environment. The platform enables advanced statistical modeling and economic research across multiple business lines and federal stakeholders. We are seeking a Senior Systems Engineer with deep expertise in Linux system administration, automation, and HPC environments. This role is ideal for engineers who excel in performance optimization, platform reliability, and supporting highly technical user communities such as data scientists and researchers. You will be responsible for maintaining and enhancing a high-availability analytics platform, providing Tier 3 support, and collaborating with cross-functional teams to deliver scalable and secure solutions that meet evolving analytical demands.

Requirements

  • Administering and maintaining Linux-based server environments in high-performance or enterprise settings
  • Managing high-performance computing (HPC) platforms, including workload scheduling and resource optimization
  • Utilizing automation tools such as Ansible and Ansible Automation Platform for configuration management and system orchestration
  • Monitoring system performance, tuning resources, and ensuring high availability and reliability
  • Providing Tier 3 support for complex system and platform issues in production environments
  • Supporting analytical and statistical workloads using tools such as Python, R, MATLAB, Stata, or SAS
  • Collaborating with data scientists, analysts, and business stakeholders to translate requirements into technical solutions
  • Documenting system configurations, operational procedures, and troubleshooting methodologies
  • Strong experience with Linux system administration and shell scripting
  • Hands-on experience with Ansible for automation and configuration management
  • Experience supporting high-performance computing environments
  • Strong problem-solving skills and customer-focused mindset
  • Ability to work in an on-call rotation supporting mission-critical systems

Nice To Haves

  • Experience with HPC workload managers such as SLURM
  • Familiarity with platforms such as Open OnDemand for HPC user access
  • Experience supporting research, analytics, or data science environments
  • Strong background in system security, vulnerability management, and compliance
  • Experience with performance benchmarking and system optimization
  • Exposure to cloud-based HPC or hybrid computing environments

Responsibilities

  • Design, maintain, and optimize Linux-based HPC infrastructure supporting analytical workloads
  • Perform system updates, patching, and security hardening to ensure compliance and stability
  • Provide Tier 3 support for platform-related issues, ensuring minimal downtime and rapid resolution
  • Collaborate with stakeholders to align system capabilities with analytical and research needs
  • Implement and maintain security controls to protect sensitive data and meet regulatory requirements
  • Conduct system audits, vulnerability assessments, and performance evaluations
  • Participate in platform enhancement initiatives, including upgrades and new feature implementation
  • Contribute to system architecture design and long-term platform strategy
  • Participate in on-call rotation to support critical system availability
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service