Lead Site Reliability Engineer

MastercardO'fallon, MO
Onsite

About The Position

The Mastercard Business Operations (BizOps) organization is seeking a Lead BizOps Engineer to serve as a technical authority and operational architect across critical platforms. This role is designed for a senior individual contributor who thrives at system‑level thinking, drives SRE, DevOps maturity at scale, and influences outcomes across programs, portfolios. As a Lead BizOps Engineer, you will operate beyond a single application or team, shaping reliability strategy, defining standards, and elevating operational excellence across Mastercard’s most business‑critical services. You will partner deeply with product engineering, architecture, security, and leadership to ensure platforms are designed, delivered, and operated with resilience, scalability, and customer trust at their core. BizOps is at the forefront of Mastercard’s Operational Resilience evolution, driving modern tooling, standardized practices, and consistent operating models across the enterprise. Mission BizOps acts as the production readiness and operational resilience steward for Mastercard platforms. As a Lead BizOps Engineer, your mission is to embed reliability, operability, and compliance into platform design and delivery, ensuring services are: Highly available, resilient, and performant Observable, self‑healing, and automation‑driven Secure, compliant, and auditable by design Operated through repeatable, scalable, low‑toil processes You will provide continuous feedback loops into engineering and product teams, ensuring lessons learned from production meaningfully improve future designs and customer experience. What We Do in BizOps We deliver this mission through: Deep incident ownership with rigorous root‑cause analysis tied to business impact A shift‑left operational mindset, influencing architecture and design before code reaches production Enterprise‑grade risk management, controls, and compliance oversight Standardized and streamlined support models that reduce friction for partners Bridging product intent and operational reality to deliver reliable, customer‑centric platforms At the Lead level, you are expected to shape these practices, not just execute them.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related technical discipline, or equivalent practical experience.
  • Deep expertise in distributed systems, reliability engineering, and production operations.
  • Strong foundation in algorithms, data structures, system design, and automation.
  • Advanced troubleshooting skills across the full technology stack.
  • Proven ability to drive decisions and outcomes in high‑pressure, high‑impact environments.
  • Possess a solid understanding of databases, blob-stores like S3, and load balancers.
  • Proficiency in one or more of: Python, Go, Bash.
  • Extensive hands‑on experience with DevOps and observability tooling, such as(or similar tools): Git / Bitbucket, Jenkins / XLR, Chef, Ansible, Splunk, Dynatrace.
  • Demonstrated success building and scaling CI/CD pipelines with minimal manual intervention.

Nice To Haves

  • Certificate Management, PKI, Authentication & Authorization
  • LDAP, Active Directory Services, Access Provisioning Controls, Audit, and Compliance frameworks
  • SOAP and REST APIs and integration patterns

Responsibilities

  • Act as a Lead‑level technical authority for reliability, operability, and production readiness across multiple platforms or programs.
  • Influence system architecture, design patterns, and platform standards to improve resiliency, scalability, and fault tolerance.
  • Partner with engineering and architecture teams during pre‑production and roadmap phases to guide capacity planning, failure modeling, and launch readiness.
  • Challenge designs constructively, advocating for operational simplicity, automation, and sustainable on‑call models.
  • Own and evolve availability, latency, performance, and reliability objectives for critical systems.
  • Lead complex production events and cross‑platform investigations, reducing MTTR through systemic fixes, not workarounds.
  • Champion blameless postmortems, ensuring remediation actions translate into measurable reliability improvements.
  • Identify recurring failure patterns and drive engineering‑led elimination of toil.
  • Provide leadership for CI/CD strategy, ensuring pipelines support automated validation, risk‑based gating, and safe, repeatable deployments.
  • Drive adoption of automation‑first practices across build, deploy, test, recovery, and compliance workflows.
  • Influence DevOps standards across teams, enabling consistent, high‑quality software delivery at scale.
  • Define and promote standards for monitoring, alerting, SLOs, and telemetry.
  • Enable proactive detection, predictive alerting, and self‑healing capabilities across platforms.
  • Ensure observability is treated as a first‑class architectural requirement, not an afterthought.
  • Partner with security, risk, and compliance teams to embed controls, auditability, and regulatory requirements into platform design and operations.
  • Ensure operational practices meet Mastercard’s enterprise risk and compliance expectations across all environments.
  • Mentor senior and junior engineers, raising the technical bar across the BizOps community.
  • Contribute to guild initiatives, standards, whitepapers, and best‑practice guidance.
  • Influence leaders and peers through data, experience, and clear technical narratives.
  • Represent BizOps in cross‑organizational forums as a trusted advisor on reliability and operations.

Benefits

  • insurance (including medical, prescription drug, dental, vision, disability, life insurance)
  • flexible spending account and health savings account
  • 16 weeks of new parent leave
  • up to 20 days of bereavement leave
  • 80 hours of Paid Sick and Safe Time
  • 25 days of vacation time
  • 5 personal days
  • 10 annual paid U.S. observed holidays
  • 401k with a best-in-class company match
  • deferred compensation for eligible roles
  • fitness reimbursement or on-site fitness facilities
  • eligibility for tuition reimbursement
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service