CSCI Consulting-posted 5 months ago
Full-time • Mid Level
Quantico, VA
101-250 employees

CSCI Consulting is looking for a Site Reliability Engineer (SRE) to ensure the performance, availability, and security of mission-critical enterprise systems in a secure federal environment. This role combines deep systems engineering knowledge with DevOps automation, proactive monitoring, and incident response practices. The ideal candidate has hands-on experience with cloud platforms, automation tools, and secure infrastructure operations. You will work closely with cross-functional teams to enhance system reliability, manage complex integrations, and support continuous delivery while ensuring compliance with DoD cybersecurity standards.

  • Maintain and optimize enterprise infrastructure performance, uptime, and reliability across hybrid environments
  • Develop and manage automation for infrastructure provisioning, configuration, and deployment using tools like Terraform, Ansible, Jenkins, or GitLab CI
  • Implement and manage monitoring, alerting, and log analysis using tools such as Splunk, Prometheus, and Grafana
  • Develop and maintain scripts in Python, Bash, or PowerShell for automation, diagnostics, and incident response
  • Support performance tuning and root cause analysis (RCA) for infrastructure and application-level issues
  • Ensure compliance with IAVA, STIGs, and other DoD cybersecurity standards
  • Collaborate with Agile teams to support sprint planning, retrospectives, and technical execution
  • Participate in disaster recovery, backup/restore, and incident management processes
  • Support interface monitoring and ensure data integrity across integrated ERP and enterprise systems
  • Bachelor’s degree in Computer Science, Information Systems, or a related technical field
  • 4–6 years of experience in site reliability engineering, DevOps, or systems/network engineering
  • Experience maintaining enterprise infrastructure with a focus on performance, security, and uptime
  • Proficiency with CI/CD tools and automation frameworks (e.g., Jenkins, GitLab CI, Ansible, Terraform)
  • Strong scripting ability using Python, Bash, or PowerShell
  • Hands-on experience with monitoring/logging tools (e.g., Splunk, Prometheus, Grafana)
  • Familiarity with cloud platforms (e.g., AWS, Oracle Cloud Infrastructure [OCI]) and virtualization (e.g., VMware)
  • Understanding of disaster recovery, backup, and incident response procedures
  • Active DoD Secret clearance or the ability to obtain one
  • Creativity and adaptability in problem-solving
  • Ability to work with clients to understand their needs
  • Strong organizational and time-management skills
  • Excellent written and verbal communication skills
  • Professional presence
  • Experience supporting DoD or federal systems in classified or mission-critical environments
  • Familiarity with Oracle Exadata, ZFS storage, and InfiniBand networking
  • Knowledge of DoD cybersecurity frameworks, including STIGs and IAVA compliance
  • Experience with performance tuning and RCA in high-availability systems
  • Experience in Agile/SAFe teams and participation in sprint planning, backlog grooming, and retrospectives
  • Exposure to interface monitoring, system integrations, and data validation in complex enterprise environments
  • Ability to work in a team environment, as well as independently
  • Strong customer and vendor relationship skills
  • Demonstrated ability to comply with data standards and policies
  • Motivation to learn new technologies and methodologies that demonstrate value
  • Past experience working with a federal agency
  • Department of Defense experience is a plus!
  • Competitive salaries
  • Generous Paid Time Off (PTO) package
  • Paid holidays aligned to the Federal calendar
  • Full health benefits including medical, dental, vision, and life insurance
  • 401(k) retirement plan
  • Team building events
  • Professional development support
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service