Site Reliability Engineer - TS/SCI with Poly

GD Information TechnologyAnnapolis Junction, MD
Onsite

About The Position

As a Site Reliability Engineer (SRE) supporting the CIO Infrastructure Services (CIS) program, you will help maintain the reliability, scalability, and performance of enterprise infrastructure services deployed across more than 250 global sites. You will engineer and optimize systems, automate operational workflows, strengthen monitoring capabilities, and ensure the stability and resilience of mission critical‑ environments. You will partner closely with Engineering, Operations, Tech Refresh, Cybersecurity, and Data Center teams to ensure seamless integration of new capabilities into a high availability production environment, helping the Defense Intelligence Enterprise remain secure, connected, and ‑mission ready‑.

Requirements

  • Active TS/SCI with CI Polygraph
  • Bachelor’s degree in computer science, engineering, IT, or related technical field (Additional experience may substitute for degree)
  • 5+ years of experience in site reliability engineering, systems engineering, enterprise operations, or DevOps roles
  • Hands‑on experience with automation tools (PowerShell, Python, Ansible, Terraform, etc.)
  • Strong experience supporting enterprise infrastructure domains including server compute, storage, virtualization, networking, and monitoring
  • Experience with enterprise monitoring platforms (e.g., SolarWinds, SCOM, Splunk, Nagios, ELK)
  • Strong understanding of ITIL/ITSM workflows and operational governance processes
  • Demonstrated ability to troubleshoot complex technical issues across distributed enterprise environments
  • Strong communication and collaboration skills working across multidisciplinary technical teams
  • Excellent communication and stakeholder engagement skills
  • US citizenship required

Nice To Haves

  • ITIL v4 Foundations certification
  • Experience supporting the client, DoDIIS, or Intelligence Community environments
  • Familiarity with CMMC, NIST 800‑53, policies, and RMF processes
  • Experience with ServiceNow/Service Central and automated ticketing workflows
  • Experience supporting hybrid cloud, virtual desktop infrastructure (VDI), or hyperconverged platforms

Responsibilities

  • Ensure the reliability, availability, and performance of enterprise IT systems across global environments
  • Develop automation solutions that reduce manual effort, streamline operational tasks, and improve system resiliency
  • Build and maintain monitoring, alerting, and observability capabilities supporting 24/7/365 enterprise operations
  • Perform root cause analysis (RCA), corrective action planning, and long-term‑ problem remediation for infrastructure issues
  • Partner with engineering teams to validate, test, and integrate new systems, upgrades, baselines, and enhancements into production
  • Improve system performance through configuration tuning, capacity planning, and optimization of compute, storage, network, and virtualized environments
  • Develop and maintain infrastructure-as-code, scripts, and operational automation to support consistent and repeatable deployments
  • Support enterprise incident response, including triage, escalation, and service restoration for high visibility‑ events
  • Maintain operational documentation including SOPs, runbooks, baselines, dashboards, and architectural diagrams
  • Ensure compliance with ITIL/ITSM processes—including Incident, Problem, Change, and Configuration Management
  • Strengthen the enterprise security posture by supporting patching, vulnerability remediation, and RMF related‑ configuration updates
  • Coordinate with global operations teams to ensure service continuity, readiness, and adherence to SLAs and KPIs
  • Leverage analytics, metrics, and monitoring data to identify performance trends and drive continuous service improvement initiatives

Benefits

  • Comprehensive benefits and wellness packages
  • 401K with company match
  • Competitive pay
  • Paid time off
  • Full flex work weeks where possible
  • Variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave
  • Short and long-term disability benefits
  • Life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service