Lead Site Reliability Engineer (SRE)

BMOToronto, ON
Hybrid

About The Position

We are seeking a senior Site Reliability Engineer (SRE) with a strong developer background to join one of the bank’s premium IT teams. This role is designed for an engineer who has built and shipped production software, understands system behavior from the code up, and applies software engineering principles to solve reliability and operational challenges at scale. You will act as a reliability and engineering leader, partnering closely with application development teams to design, build, and operate highly reliable, scalable, and resilient cloud‑based systems. In addition to hands‑on delivery, you will help shape SRE standards, influence architecture decisions, mentor engineers, and contribute to the SRE Community of Practice.

Requirements

  • Bachelor Degree in Computer Science or relevant discipline
  • 5+ years of hands‑on software development experience using Node.js, Python, or Java
  • Strong background as a software developer
  • Experience with AWS and cloud environments
  • Deep knowledge of SRE principles

Nice To Haves

  • Experience with chaos engineering and resilience testing
  • Strong knowledge of AWS services including ECS, Lambda, and RDS

Responsibilities

  • Site Reliability Engineering & Delivery Apply software engineering principles to reliability problems, treating reliability, availability, and performance as code‑driven concerns
  • Design, build, and evolve cloud‑based services with a focus on operability, resilience, and simplicity
  • Embed with application development teams as a peer engineer, influencing design decisions and operational readiness
  • Define and improve service reliability targets (SLIs, SLOs, error budgets)
  • Developer‑Driven Automation & Platform Engineering Design and build automation, tooling, and internal frameworks
  • Promote Infrastructure as Code (IaC) and everything‑as‑code practices
  • Contribute code to CI/CD pipelines and reliability platforms
  • Observability & Resilience Architect observability solutions (metrics, logs, traces)
  • Lead disaster recovery and resilience testing exercises

Benefits

  • BMO also offers health insurance, tuition reimbursement, accident and life insurance, and retirement savings plans.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service