Senior BizOps Engineer

MastercardO'fallon, MO
96d$94,000 - $157,000

About The Position

The Reactive Systems Architecture BizOps team is looking for a BizOps Engineer who can help us solve problems, build our CI/CD pipeline and lead Mastercard in DevOps automation and best practices. The role of BizOps is to be the production readiness steward for Mastercard products. As a BizOps SRE, we are responsible for ensuring that our platform is stable and healthy. We break down barriers to run our products by fostering developer run ownership and empowering developers to build resilient products. We support our developers during the application build phase in software run principals that includes operational design, automation, capacity planning, monitoring that leads to fault-tolerant, scalable products. We see the big picture and help create and enforce operations standards while facilitating an agile and learning culture. We support daily operations with a hyper focus on triage, root cause by understanding the business impact of our products and subsequently performing blameless post-mortems. The goal of every Business Operations team is to engage early in the development lifecycle to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Business Operations teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. Ultimately, the role of Business Operations is to align Product and Customer Focused priorities with Operational needs by providing continuous feedback throughout the lifecycle. Business Operations is leading the DevOps transformation at Mastercard through our tooling and by being an advocate for change & standards throughout the development, quality, release, and product organizations. We need team members with an appetite for change and pushing the boundaries of what can be done with automation.

Requirements

  • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.
  • Coding and/or scripting exposure.
  • Experience with algorithms, data structures, scripting, pipeline management, and software design.
  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
  • Interest in designing, analysing, and troubleshooting large-scale distributed systems.
  • Willingness and ability to learn and take on challenging opportunities and to work as a member of matrix based diverse and geographically distributed project team.
  • Ability to balance doing things right with fixing things quickly.
  • Comfortable collaborating with cross-functional teams to ensure that expected system behaviour is understood and monitoring exists to detect anomalies.
  • Experience building and maintaining Kafka clusters.
  • Familiarity with Linux and supported scripting (python and bash).

Nice To Haves

  • Kafka Connect experience.
  • Experience building observability tools including but not limited to alerts and dashboards.

Responsibilities

  • Serve as the primary contact responsible for the overall application health, performance, and capacity.
  • Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
  • Partner with the development and product team of a new application to establish the right monitoring and alerting strategy and create the framework to achieve zero downtime during deployment.
  • Practice sustainable incident response and blameless post-mortems while taking a holistic approach to problem solving and optimizing time to recover.
  • Automate data-driven alerts to proactively escalate issues.
  • Work with development teams to establish SLOs and improve reliability.
  • Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement.
  • Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating.
  • Increase automation and tooling to reduce toil and manual intervention.
  • Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns.

Benefits

  • Insurance (including medical, prescription drug, dental, vision, disability, life insurance).
  • Flexible spending account and health savings account.
  • Paid leaves (including 16 weeks of new parent leave and up to 20 days of bereavement leave).
  • 80 hours of Paid Sick and Safe Time.
  • 25 days of vacation time and 5 personal days, pro-rated based on date of hire.
  • 10 annual paid U.S. observed holidays.
  • 401k with a best-in-class company match.
  • Deferred compensation for eligible roles.
  • Fitness reimbursement or on-site fitness facilities.
  • Eligibility for tuition reimbursement.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service