Acara-posted 5 months ago
Mid Level
Scottsdale, AZ
Professional, Scientific, and Technical Services

As a Site Reliability Engineer (SRE) for GDMS's Space and Intelligence Systems line of business, you will be a member of a cross functional team responsible for maintaining survivability and reliability of mission critical resources. SREs monitor high priority systems and automate recovery mechanisms to ensure they remain operational for the warfighter.

  • Ensuring Uptime of Critical Systems (Incident Response / Triage)
  • Automating Systems Administration Activities (Bash / Python / Ansible are preferred)
  • Monitoring, and Troubleshooting Enterprise Services (Prometheus, Grafana, Splunk)
  • Configuring Enterprise Services (Ansible, YAML, JSON)
  • Developing recovery procedures for large systems (Backup and Restore, Blue/Green Deployment)
  • Bachelor's Degree in Software Engineering or Science or Mathematics.
  • Minimum 5 years of experience in ensuring the Uptime of Critical Systems (Incident Response / Triage)
  • Minimum 5 years of experience in Monitoring and Troubleshooting Enterprise Services (Prometheus, Grafana, Splunk)
  • Minimum 5 years of experience in Configuring Enterprise Services (Ansible, YAML, JSON)
  • Minimum 5 years of experience in developing recovery procedures for large systems (Backup and Restore, Blue/Green Deployment)
  • Master's Degree
  • Agile experience.
  • Experience in monitoring large-scale systems and using automation to triage emerging issues
  • Automating Systems Administration Activities (Bash / Python / Ansible)
  • Track record of ensuring system uptime demonstrated by diagnosing and triaging complex system-wide incidents
  • Collaborative team player with experience working on teams with diverse engineering skills
  • Thorough knowledge of technology trends and willingness to champion new ideas and process improvement
  • Mixed job experience involving software engineering, systems administration, and network engineering
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service