Senior Reliability Engineer

MastercardO'fallon, MO
5d$96,000 - $163,000

About The Position

Mastercard is committed to connecting and powering an inclusive, digital economy that benefits everyone, everywhere. The company focuses on making transactions safe, simple, smart, and accessible. Through secure data, global networks, strong partnerships, and a passion for innovation, Mastercard helps individuals, financial institutions, governments, and businesses reach their full potential. A core part of Mastercard’s identity is its Decency Quotient (DQ)—a cultural foundation that guides how employees work, collaborate, and lead. Mastercard fosters a culture of inclusion that respects individual strengths, perspectives, and experiences. The company believes that diversity drives better decisions, fuels innovation, and leads to stronger business outcomes. Technology at Mastercard shapes the future of the digital economy. The organization builds revolutionary systems that make global commerce more connected, inclusive, sustainable, and secure. Mastercard technologists work on a truly global network, creating critical systems and products that enable people everywhere to access essential goods and services. The culture is inclusive, diverse, and collaborative—celebrating strengths, valuing experience, and offering flexibility to build a career across disciplines and continents. Employees work alongside experts and leaders at all levels, improving existing systems and inventing what comes next. The Business Operations (Biz Ops) team serves as the production readiness steward for Mastercard products. As a Business Operations Site Reliability Engineer (SRE) / Operational Readiness Architect, the mission is to ensure platform stability, health, and resilience.

Requirements

  • BS in Computer Science or related technical field, or equivalent practical experience.
  • Curiosity and appetite for automation, new technologies, and scalable architectures.
  • Strong problem‑solving skills, communication abilities, ownership, and drive.
  • Interest in large‑scale distributed systems design, analysis, and troubleshooting.
  • Ability to work in diverse, matrix‑based, geographically distributed teams.
  • Balance between long‑term system health and short‑term fixes.
  • Ability to collaborate cross‑functionally with clear understanding of expected system behavior and monitoring needs.
  • Experience in industry standard CI/CD tools like Git/Bitbucket, Jenkins, Maven, Artifactory, and Chef. Experience designing and implementing an effective and efficient CI/CD flow that gets code from dev to prod with high quality and minimal manual effort is desired.
  • Experience in one or more of the following is preferred: C, C++, Java, Python, Go, Perl or Ruby.
  • Ability to work in shifts and weekends when in needed & based on team members rotations & schedule.

Nice To Haves

  • Experience with algorithms, data structures, scripting, pipeline management, and software design.
  • Experience working across development, operations, and product teams.
  • Prior SRE experience.
  • Expertise in RDBMS such as PostgreSQL and Oracle.
  • Proficiency in SQL, PL/SQL, and PostgreSQL features.
  • Strong understanding of database architecture, performance tuning, and query optimization.
  • Experience with monitoring tools (e.g., Splunk, Dynatrace).
  • Experience in production support and ITIL processes.
  • Experience with CI/CD tools: Git/Bitbucket, Jenkins, Maven, Artifactory, Groovy, Chef.
  • Understanding of:
  • Client‑server relationships
  • Network concepts (Layer 1–3)
  • Stack trace analysis (TCP dumps, heap/CPU/memory/thread dumps)
  • Load balancers and application firewalls
  • Operating system navigation
  • Logging and monitoring standards
  • High availability and business continuity
  • Caching concepts
  • Configuration management
  • Awareness of security implementations, certificate lifecycle management, mutual TLS, SSL handshake, SSH keys, and encryption methods (symmetric/asymmetric).

Responsibilities

  • Foster developer ownership and empower teams to build resilient, fault‑tolerant, scalable products.
  • Support developers during the build phase with operational design, automation, capacity planning, and monitoring.
  • Establish and enforce operational standards while promoting an agile, learning‑focused culture.
  • Lead triage and root‑cause analysis with a focus on business impact and blameless post‑mortems.
  • Engage early in the development lifecycle to proactively manage production and change activities.
  • Drive risk management, compliance, and mitigation across environments.
  • Align product and customer priorities with operational needs through continuous feedback.
  • Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices.
  • Practice sustainable incident response and blameless post-mortems.
  • Take a holistic approach to problem solving, by connecting the dots during a production event thru the various technology stack that makes up the platform, to optimize mean time to recover
  • Work with a global team spread across tech hubs in multiple geographies and time zones
  • Share knowledge and mentor junior resources
  • Serve as the primary contact for application health, performance, and capacity.
  • Support services before launch through system design consulting, capacity planning, and launch reviews.
  • Partner with development and product teams to define monitoring and alerting strategies.
  • Build frameworks that enable zero‑downtime deployments.
  • Analyze ITSM activities and provide feedback to development teams on operational gaps and resiliency concerns.

Benefits

  • insurance (including medical, prescription drug, dental, vision, disability, life insurance)
  • flexible spending account and health savings account
  • paid leaves (including 16 weeks of new parent leave and up to 20 days of bereavement leave)
  • 80 hours of Paid Sick and Safe Time, 25 days of vacation time and 5 personal days, pro-rated based on date of hire
  • 10 annual paid U.S. observed holidays
  • 401k with a best-in-class company match
  • deferred compensation for eligible roles
  • fitness reimbursement or on-site fitness facilities
  • eligibility for tuition reimbursement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service