Director, Site Reliability

Early Warning®Scottsdale, AZ
Hybrid

About The Position

The Director, Site Reliability is a highly impactful role responsible for ensuring the reliability of EWS applications by managing the EWS Platform Site Reliability team. The SRE team supports our first responders in the tools and information they use, while driving mitigations for all high priority incidents and owning the RCA program to prevent incidents from recurring. Engineering at Early Warning (EWS) is a blend of teams organized around many different platforms, capabilities and products that are brought together to power core capabilities at the biggest banks in America – this includes ubiquitous products like Zelle® and Paze. These capabilities are typically provided behind a customer-facing API or integration point which enables the EWS teams to innovate aggressively where big wins can be found. The teams aligned behind these efforts drive their own innovation in partnership with stakeholders. If you are hungry for large scale challenges and crave opportunities to learn and contribute in a big way – we’d love to talk to you!

Requirements

  • Bachelor’s Degree in Computer Science or related field.
  • 10 or more years related experience with at least 7 years of management experience supervising systems support or banking operations personnel with knowledge of business production systems coupled with the ability to apply this towards business operations.
  • Demonstrated success developing and retaining highly engaged, high performing teams and aligning talent to meet business needs.
  • Demonstrated experience establishing and maintaining Site Reliability principals for a modern organization
  • Demonstrated experience establishing and leading 24X7 on-call teams in engineering
  • Experience with industry standard tools such as GitLab, JIRA, Terraform, Vault, Grafana, PagerDuty, AppDynamics etc.
  • Background and drug screen
  • Candidates responding to this posting must independently possess the eligibility to work in the United States, for any employer, at the date of hire.
  • This position is ineligible for employment Visa sponsorship.

Nice To Haves

  • Additional related education and/or experience preferred.

Responsibilities

  • Leads a high performing team of Site Reliability Engineers that provides 24/7 response to critical incidents.
  • Develops, manages, and owns tools that provide observability, monitoring and alerting for all EWS product applications.
  • Works closely with product and infrastructure development teams to ensure our applications are instrumented and measured.
  • Responsible for implementing all changes through pipelines and code.
  • Identify, evangelize and implement reliability patterns for EWS product applications.
  • Owns and improves the Incident Management Program, Policy, and Procedures for EWS, including both handling active incidents and the Root Cause Analyses process.
  • Owns the Change Management function at EWS ensuring changes to critical environments have risk appropriately identified and mitigated.
  • Supports the company’s commitment to risk management and protecting the integrity and confidentiality of systems and data.

Benefits

  • Healthcare Coverage – Competitive medical (PPO/HDHP), dental, and vision plans as well as company contributions to your Health Savings Account (HSA) or pre-tax savings through flexible spending accounts (FSA) for commuting, health & dependent care expenses.
  • 401(k) Retirement Plan – Featuring a 100% Company Safe Harbor Match on your first 6% deferral immediately upon eligibility.
  • Paid Time Off – Flexible Time Off for Exempt (salaried) employees, as well as generous PTO for Non-Exempt (hourly) employees, plus 11 paid company holidays and a paid volunteer day.
  • 12 weeks of Paid Parental Leave
  • Maven Family Planning – provides support through your Parenting journey including egg freezing, fertility, adoption, surrogacy, pregnancy, postpartum, early pediatrics, and returning to work.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service