Technology Resilience Manager

OptimumTown of Oyster Bay, NY
5d

About The Position

The Technology Resilience Manager leads the Technology Resilience Program (TRP) across the CPTO organization, ensuring that critical technology platforms, products, and services are designed, operated, and maintained to withstand and rapidly recover from disruptions. This role is responsible for defining and driving resilience strategies that align technology architecture and operations with business continuity, disaster recovery, operational risk, and regulatory objectives. Working closely with all key Technology domain leaders, (product, engineering, cloud, infrastructure, and security teams), the Manager, Technology Resilience embeds resilience principles into system design and day-to-day operations, strengthening the organization’s ability to anticipate, absorb, and recover from technology and cyber incidents.

Requirements

  • Bachelor’s degree in Information Technology, Engineering, Computer Science, or a related field (or equivalent experience)
  • 7–10+ years of experience in technology operations, engineering, reliability, resilience, disaster recovery, or operational risk roles
  • Strong understanding of modern technology environments, including cloud platforms, distributed systems, and large-scale infrastructure
  • Proven experience leading cross-functional initiatives and influencing technology teams without direct authority

Nice To Haves

  • Experience within a telecommunications, critical infrastructure, or highly regulated environment
  • Certifications such as ISO 22301, ITIL, SRE, CISSP, CRISC, or cloud architecture certifications
  • Experience supporting regulatory reviews, audits, or enterprise risk programs

Responsibilities

  • Lead and mature the Technology Resilience Program (TRP) across the CPTO organization, establishing clear objectives, standards, and success metrics.
  • Define enterprise technology resilience strategies aligned with business continuity, disaster recovery, operational risk management, and service availability goals.
  • Promote a culture of resilience by integrating resilience thinking into technology planning, delivery, and operations.
  • Partner with product, engineering, cloud, and infrastructure teams to embed resilience principles into system architecture, design patterns, deployment models, and operational processes.
  • Provide guidance on redundancy, fault tolerance, failover, backup, and recovery strategies for critical applications and platforms.
  • Influence technology roadmaps to address resilience gaps and reduce systemic risk.
  • Coordinate technology resilience risk assessments and impact analyses to identify single points of failure, dependency risks, concentration risks, and recovery gaps across the technology stack.
  • Assess the potential business and operational impact of technology and cyber disruptions, translating findings into prioritized remediation actions.
  • Maintain a centralized view of technology resilience risks and dependencies.
  • Work closely with Technology leaders in key domains to develop, maintain, and validate recovery procedures for critical applications, platforms, and infrastructure components.
  • Ensure recovery strategies meet defined recovery time objectives (RTO) and recovery point objectives (RPO).
  • Drive regular resilience exercises and scenario testing, including simulations of system outages, cyberattacks, cloud failures, and cascading technology incidents.Implement resilience governance frameworks, standards, and reporting mechanisms to track program maturity, risk exposure, and remediation progress.
  • Support leadership decision-making by framing risks, trade-offs, and investment priorities.
  • Monitor the evolving technology and cyber threat landscape, including emerging risks related to cloud, software dependencies, and third-party platforms.
  • Ensure resilience strategies and plans are proactively updated based on incidents, near-misses, testing outcomes, and external threat intelligence.
  • Drive continuous improvement through lessons learned and post-incident reviews.
  • Ensure alignment with enterprise-wide resilience, risk management, and compliance frameworks.
  • Contribute to audit readiness and regulatory obligations related to technology resilience, operational risk, and service continuity.
  • Partner with risk, compliance, and internal audit teams to address findings and strengthen controls.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service