Senior Reliability Engineer

Fitch GroupChicago, IL
Hybrid

About The Position

Fitch Group is seeking a Senior Service Reliability Engineer to join the Fitch Solutions development squads in Chicago, USA. As a global financial information services provider, Fitch Group offers credit and risk insights, data, and tools. The Technology & Data Team is a dynamic department focused on innovation and leveraging cutting-edge technologies like AI and cloud solutions. Fitch Solutions SRE provides Service Reliability Engineering expertise to Fitch’s development organizations, acting as subject matter experts in cloud technologies, systems engineering, infrastructure automation, and DevOps tooling. This role is dedicated to ensuring excellence in Fitch Solutions services, with a focus on new AI development.

Requirements

  • Deep, hands-on experience in SRE, DevOps, or Platform Engineering across both AWS and Azure.
  • Strong track record operating Docker and Kubernetes in production environments.
  • Highly proficient in administering both Linux and Windows.
  • Practical, enterprise-level experience supporting IIS/.NET applications as well as Java Spring Boot services.
  • Experience building and maintaining CI/CD pipelines (primarily GitHub Actions; Bamboo experience a plus) with DevSecOps principles baked in—integrating security scans, policy-as-code, and compliance gates.
  • Script confidently in Python, PowerShell, or Bash.
  • Experience with cloud security best practices (IAM, secrets management, container/image scanning).
  • Understand core infrastructure fundamentals (networking, storage, DNS) and APM/telemetry tooling.

Nice To Haves

  • Practical experience with agentic AI for operations—incident triage, runbooks, and change management—with clear guardrails, auditability, and human-in-the-loop controls.
  • Supporting AI/ML workloads at scale: SageMaker endpoints, GPU node groups, autoscaling, and Kubernetes-based model serving.
  • Policy-as-code (OPA) and compliance implementation across CIS, NIST, ISO 27001, with automated remediation integrated via CSPM tools (e.g., Wiz).
  • Applying AI in CI/CD, observability, and incident response using AWS Bedrock/SageMaker and Model Context Protocol (MCP).
  • Hands-on Agile delivery experience, actively participating in stand-ups and sprint ceremonies.

Responsibilities

  • Lead the delivery of reliable, scalable, mission-critical services.
  • Guide squads on Kubernetes and modern deployment patterns.
  • Mentor associate engineers and set best practices.
  • Partner closely with Fitch Solutions Development Squads and Operations to design and advance service builds, DevOps tooling, and drive operational excellence.
  • Partner with Core Engineering to architect and govern GitHub Actions CI/CD with quality gates, canary/blue‑green strategies, and AI‑assisted redeploy checks.
  • Own observability in Datadog—define SLIs/SLOs, dashboards, alerting, and MS Teams integrations—and reduce incidents via telemetry-driven automation and blameless postmortems.
  • Champion AI‑enabled operations using AWS Bedrock/SageMaker and Model Context Protocol (MCP) for log analysis, anomaly detection, incident triage, and workflow orchestration; establish adoption guardrails.
  • Define and enforce cloud guardrails and security controls (SCPs/IAM boundaries, OPA policies, tagging, centralized logging with AWS Config/CloudTrail/Security Hub) in partnership with Security and Risk.
  • Influence cross‑functional roadmaps, lead complex release planning, and drive strategic platform initiatives across CI&PE; serve as an escalation point and participate in the L3 on‑call rotation.

Benefits

  • Hybrid Work Environment: 2 to 3 days a week in office required based on your line of business and location
  • A Culture of Learning & Mobility: Dedicated trainings, leadership development and mentorship programs
  • Investing in Your Future: Retirement planning and tuition reimbursement programs
  • Promoting Health & Wellbeing: Comprehensive healthcare offerings
  • Supportive Parenting Policies: Family-friendly policies, including a generous global parental leave plan
  • Inclusive Work Environment: A collaborative workplace where all voices are valued, with Employee Resource Groups
  • Dedication to Giving Back: Paid volunteer days, matched funding for donations and ample opportunities to volunteer in your community
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service