Observability and Monitoring Analyst

CapgeminiMcLean, VA
$120,000 - $135,000

About The Position

Capgemini Government Solutions (CGS) LLC is seeking highly motivated and experienced Observability and Monitoring Analyst to join our team to support our government clients. You will have the chance to use and develop your skills, work with a driven and innovative team, engage with different collaborators, and strengthen CGS capabilities. The successful applicant will have the opportunity to apply and grow their skillset, work with a motivated and entrepreneurial team, engage with a wide range of stakeholders, and build CGS capabilities to serve our clients.

Requirements

  • Active TS/SCI clearance – Must Have
  • 10+ years of relevant work experience
  • Skills are focused on infrastructure health telemetry (CPU, memory, storage), Log analysis, infrastructure architecture, log correlation and system health interpretation. Network health interpretation and user activity review.
  • Cybersecurity proficiency.
  • Working within high visibility and mission critical aspects of a program.
  • Experience configuring and supporting Openobserve, Splunk, Elastic, fluentd, and Grafana.
  • Experience with cloud automation, specifically autoscale, configuring autoscale triggers and AI analytics with anomaly detection.
  • Experience with React, AI foundation models, Cloud automation templates, and API management.

Nice To Haves

  • SAFe Scrum Master (SSM) or SAFe Advanced Scrum Master (SASM) certification – Preferred

Responsibilities

  • Design, implement and maintain observability frameworks that provide visibility into system performance, available and reliability. This includes: Key logging, Session replay, AI-assisted operations, User activity monitoring
  • Design, build and manage an open-source project observability stack.
  • Optimize monitoring coverage, ensuring alignment with SLAs.
  • Collaborate with DevOps, SRE, and development teams to ensure systems are observable by design.
  • Perform anomaly detection and alerting on threats, issues or trends.
  • Perform root cause analysis (RCA) and contribute to post-incident reviews.
  • Manage data streams and log ingestion from multiple sources.
  • Develop dashboards, alerts, and reports. This includes: Tracking and reporting on user and system activities, Monitoring system health across the environment, Leveraging AI/ML for recommendations and predictions
  • Ensure data collection has appropriate logs, data is tagged, and correlation is built.
  • Ensure compliance with data privacy regulations.
  • Ensure new releases do not impact the environment.
  • Ensure the Agile process is followed, including implementing and supporting Agile SAFe principles and practices.
  • Oversee the efforts of less senior staff.

Benefits

  • paid time off
  • medical/dental/vision insurance
  • 401(k)
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service