Senior Manager, Technology Scenario Testing

ScotiabankToronto, ON
Onsite

About The Position

As the Senior Manager Technology Scenario Testing, you contribute to the global success of the Resilience Engineering function by designing, facilitating, assessing, and governing Technology Scenario Tests in order to continuously improve the bank’s resilience capabilities. In this role you will deliver on the requirements of the Technology Scenario Testing Program and help to drive improvements in automation, testing, and observability capabilities that enhance the stability and reliability of technology services. In this role, you will: Champion a customer focused, engineering driven culture that strengthens resilience across the bank’s global technology organization. Provide thought leadership on resilience engineering practices, including risk and hazard analysis, scenario design, scenario facilitation techniques, and post-exercise analysis. Work to build consensus across the organization and foster a culture of resilience and reliability Work with teams across the organization to identify gaps build strong resilience capabilities Engage with stakeholders in both the technology and business communities in order to design and coordinate effective technology scenario tests Design and manage Key Performance Indicators (KPIs) and Key Risk Indicators (KRIs) as they relate to Technology Scenario Testing and engage stakeholders to communicate performance Perform qualitative analysis of past scenario tests and pervious technology incidents and work with stakeholders to update relevant scenario libraries to improve testing effectiveness Clearly articulate complex engineering concepts and resilience risks to both technical and non-technical audiences, enabling informed decision making. Lead and participate in resilience engineering forums, communities of practice, and cross-functional working groups. Use analytical tooling (e.g., SQL, Python, data visualization platforms, monitoring and observability systems) to derive trends, correlations, and identify focus areas for testing exercises. Develop, administer and review resilience assessments for technical services within the bank Review and analyze previous incident reports and resilience assessments and develop severe but plausible scenarios for testing Work with service owners and business stakeholders to identify scope, constraints, and evaluation criteria for technology scenario testing exercises Act as primary facilitator for Technology Scenario Tests Lead the analysis of completed tests and act as lead author of the final testing report, identifying risks and creating action items Champion a culture of insight driven engineering, ensuring teams use data effectively to inform decisions, drive proactivity, and validate resilience controls

Requirements

  • 7–10+ years of experience in technology engineering roles such as site reliability engineering (SRE), platform engineering, solution architecture, system design, production engineering, or incident response.
  • Experience in performing quantitative and qualitative analysis of incident data in order to find patterns and identify risks
  • Proficiency with analytics tooling (SQL, Python, Jupyter, PowerBI)
  • Experience in, or exposure to, Process Hazard Analysis techniques (HAZOP, What-If, FMEA, FTA, etc) and System Hazards Analysis techniques (STPA, FRAM, etc)
  • Familiarity with ITSM processes and tooling (ServiceNow experience is a plus)
  • Ability to convert complex datasets into engineering actions, design improvements, and strategic recommendations
  • Strong leadership presence with the ability to influence at senior levels across technology and risk stakeholders
  • Excellent communication skills—able to translate complex engineering findings into clear narratives and actionable recommendations
  • Strong organizational and prioritization skills, able to manage multiple workstreams concurrently
  • Demonstrated collaboration across enterprise functions, including architecture, security, operational resilience, development, and infrastructure teams
  • Strong understanding of distributed systems, cloud architectures, networking fundamentals, and application patterns that impact availability and resilience
  • Hands on experience with resilience engineering practices such as: failure mode and dependency analysis, automated failover/failure testing, observability, metrics, logging, distributed tracing, performance and capacity engineering, chaos engineering
  • Experience deploying and operating software on major cloud platforms (GCP, Azure) and container/orchestration ecosystems (Kubernetes)

Nice To Haves

  • Professional certifications (e.g., cloud architecture, SRE, ITIL, resilience/continuity) beneficial but not required

Responsibilities

  • Champion a customer focused, engineering driven culture that strengthens resilience across the bank’s global technology organization.
  • Provide thought leadership on resilience engineering practices, including risk and hazard analysis, scenario design, scenario facilitation techniques, and post-exercise analysis.
  • Work to build consensus across the organization and foster a culture of resilience and reliability
  • Work with teams across the organization to identify gaps build strong resilience capabilities
  • Engage with stakeholders in both the technology and business communities in order to design and coordinate effective technology scenario tests
  • Design and manage Key Performance Indicators (KPIs) and Key Risk Indicators (KRIs) as they relate to Technology Scenario Testing and engage stakeholders to communicate performance
  • Perform qualitative analysis of past scenario tests and pervious technology incidents and work with stakeholders to update relevant scenario libraries to improve testing effectiveness
  • Clearly articulate complex engineering concepts and resilience risks to both technical and non-technical audiences, enabling informed decision making.
  • Lead and participate in resilience engineering forums, communities of practice, and cross-functional working groups.
  • Use analytical tooling (e.g., SQL, Python, data visualization platforms, monitoring and observability systems) to derive trends, correlations, and identify focus areas for testing exercises.
  • Develop, administer and review resilience assessments for technical services within the bank
  • Review and analyze previous incident reports and resilience assessments and develop severe but plausible scenarios for testing
  • Work with service owners and business stakeholders to identify scope, constraints, and evaluation criteria for technology scenario testing exercises
  • Act as primary facilitator for Technology Scenario Tests
  • Lead the analysis of completed tests and act as lead author of the final testing report, identifying risks and creating action items
  • Champion a culture of insight driven engineering, ensuring teams use data effectively to inform decisions, drive proactivity, and validate resilience controls

Benefits

  • Upskilling through online courses, cross-functional development opportunities, and tuition assistance.
  • Competitive Rewards program including bonus, flexible vacation, personal, sick days and benefits will start on day one.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service