Site Reliability Engineer

Momentum Financial Services GroupToronto, ON
CA$110,000 - CA$120,000Hybrid

About The Position

The Site Reliability Engineer is responsible for ensuring the availability, performance, and resilience of the organization's digital banking and financial services platforms. This role focuses on automating operational processes, defining and maintaining service-level objectives, and engineering systems that can withstand and recover from failure. You will work closely with engineering, DevOps, QA, cybersecurity, and compliance teams to ensure platform reliability meets both technical and regulatory standards, while minimizing risk to production systems through proactive monitoring, incident response, and continuous improvement of the software delivery lifecycle.

Requirements

  • CI/CD tools
  • Cloud platforms (AWS, Azure).
  • Containers and orchestration (Docker, Kubernetes).
  • Scripting languages (Python, Bash).
  • Infrastructure as Code (Terraform, Ansible).
  • Observability and monitoring tools
  • Strong cross-functional collaboration and communication across engineering and compliance teams.
  • Rigorous attention to detail with a proactive approach to risk and failure detection.
  • Ability to perform under pressure and respond decisively during incidents and regulatory deadlines.
  • Bachelor's degree in Computer Science, Information Technology, or related field.
  • 3-5 years in Site Reliability Engineering, DevOps, or Platform Engineering within financial services or fintech.
  • Hands-on experience maintaining reliability for real-time transaction systems, mobile banking, or payment gateways.
  • Familiarity with regulatory compliance requirements and their operational implications for production systems.

Responsibilities

  • Define and maintain service-level objectives (SLOs), error budgets, and reliability targets aligned with business goals and compliance deadlines.
  • Oversee the end-to-end service lifecycle, from code integration to production deployment, with a focus on stability and risk reduction.
  • Ensure all changes comply with relevant financial regulations.
  • Conduct reliability risk and blast-radius assessments before production changes.
  • Coordinate go/no-go decisions with engineering, QA, compliance, and operations stakeholders.
  • Own build, test, and deployment pipelines across multiple environments (staging, UAT, production), ensuring changes are safe, repeatable, and observable.
  • Design and maintain automated CI/CD pipelines and enforce version control policies (e.g., Git Flow) to reduce toil and human error.
  • Engineer zero-downtime deployments and low-impact change strategies for high-availability systems.
  • Develop and maintain rollback, failover, and disaster recovery runbooks for production incidents.
  • Collaborate with Information Security and Compliance teams to validate that infrastructure and deployment practices meet data protection and privacy standards.
  • Maintain audit-ready documentation of change activity, incident timelines, and remediation records.
  • Support internal and external audits with detailed operational and change history.
  • Drive automation, standardization, and observability improvements across the production environment.
  • Conduct post-incident reviews (blameless post-mortems) to identify systemic failures and prevent recurrence.
  • Contribute to DevOps and SRE maturity initiatives across engineering teams.
  • Act as the central liaison between product, development, and compliance teams on production health and change risk.
  • Communicate change scope, reliability risks, and incident status clearly to both technical and non-technical stakeholders.
  • Provide regular reliability reporting, SLO performance metrics, and incident trends to senior management.

Benefits

  • Competitive pay aligned with experience and market standards
  • Discretionary Annual Bonus
  • Comprehensive Benefits – Health and dental coverage with premiums fully paid, plus access to an Employee Assistance Program
  • Retirement Plans
  • Hybrid Work Environment
  • Tuition reimbursement
  • Professional development support
  • Discounts through Perkopolis
  • Recognition programs that celebrate your impact
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service