Senior Integration - Site Reliability Engineer (SRE)

CVS HealthHartford, CT
2dHybrid

About The Position

We’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger – helping to simplify health care one person, one family and one community at a time. Position Summary We are seeking a Senior Integration SRE to lead the reliability, availability, and scalability of our integration platforms—including various API gateways, event streams, messaging systems, and data pipelines. In this role, you will operate mission-critical systems that connect internal and external applications. You will define SLOs/SLIs, build resilience patterns, improve observability, and elevate engineering teams’ production readiness practices. This is a technical leadership role for an engineer who thrives on solving complex distributed systems challenges, manage offshore 24/7 operations team, improving reliability through automation, and driving cross-team alignment on platform and operational excellence. Success Indicators - 99.9% availability for integration systems. - Improved p95 latency across APIs. - Error budgets consistently maintained. - MTTR improves quarter over quarter. - Automation and AI coverage increase by 50%. - Organization adopts standardized patterns for integration reliability. Candidates can be located anywhere in the US and work in hybrid or remote model. Preferred locations for hybrid model - CT, RI, IL, AZ, TX

Requirements

  • 7+ years experience in SRE/Production/Integration Engineering.
  • 5+ years experience with Kubernetes, Gateways like APIC, Apigee, Kong and APIM.
  • 5+ years of experience in secure API integration patterns, including OAuth 2.0, JWT validation, IP filtering and Private endpoint.
  • 5+ years of experience with traffic optimization techniques such as caching, throttling, and request/response transformation.
  • 7+ years of experience defining and managing SLIs, SLOs, and SLAs for API platform.
  • 7+ years of experience in Incident response, Root cause analysis and Post-incident reviews/learning.
  • 6+ years of experience in working cross‑functionally with development, security, and product teams.
  • 6+ years of experience with proven incident response leadership.
  • 3+ years of experience working in 24/7 /on call support.

Nice To Haves

  • Experience designing idempotent and fault-tolerant integration patterns.
  • Experience with distributed tracing.
  • Background in regulatory industries (FinTech/Healthcare/etc.).
  • Hands on experience with APIC (IBM API Connect) or any API Management Platform
  • Excellent interpersonal and communication skills to work with all levels
  • Strong organizational, leadership and consensus building skills; ability to motivate and lead teams in matrix organization
  • Experience in multiple technologies in stack (DataPower, ACE, MQ and Splunk) is a PLUS
  • Healthcare experience or big box retail experience is a significant plus and will be given utmost consideration
  • AIOps experience

Responsibilities

  • Lead the reliability, availability, and scalability of our integration platforms—including various API gateways, event streams, messaging systems, and data pipelines.
  • Operate mission-critical systems that connect internal and external applications.
  • Define SLOs/SLIs
  • Build resilience patterns
  • Improve observability
  • Elevate engineering teams’ production readiness practices.
  • Solving complex distributed systems challenges
  • Manage offshore 24/7 operations team
  • Improving reliability through automation
  • Driving cross-team alignment on platform and operational excellence.

Benefits

  • Affordable medical plan options, a 401(k) plan (including matching company contributions), and an employee stock purchase plan.
  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching.
  • Benefit solutions that address the different needs and preferences of our colleagues including paid time off, flexible work schedules, family leave, dependent care resources, colleague assistance programs, tuition assistance, retiree medical access and many other benefits depending on eligibility.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service