Senior Engineer, Cyber Resilience

Stride, Inc.
7d$81,046 - $155,000Remote

About The Position

As a Cyber Resilience Senior Engineer, you will be responsible for strengthening the organization’s ability to anticipate, withstand, recover from, and adapt to cyber incidents and other disruptive events. Your primary focus will extend beyond traditional disaster recovery to include designing and implementing resilience strategies that maintain the continuity, integrity, and availability of critical IT services under adverse conditions. You will develop and enhance resilience capabilities such as redundancy, automated failover, immutable backups, cyber‑recovery architectures, and continuity-by-design patterns. You will also lead exercises and simulations to validate resilience posture, identify weaknesses, and drive continuous improvement across systems and processes. Collaboration is central to this role. You will work closely with architects, system engineers, security teams, and operational leaders to identify critical business services, understand their dependencies, assess resilience gaps, and design strategies that reduce the impact of disruptions. You will also partner with business stakeholders to ensure resilience objectives align with organizational priorities and regulatory expectations. To be successful, you should have deep experience in cyber resilience and continuity engineering, including modern cloud‑native resilience patterns, backup and recovery solutions, replication and failover strategies, zero‑trust‑aligned recovery approaches, and containerized or distributed service architectures. Strong project leadership skills are essential, as you will coordinate resilience initiatives across multiple teams and environments. Additionally, you should have excellent communication and stakeholder‑management skills, as you will be responsible for guiding teams through resilience planning, readiness assessments, and incident response activities. A solid understanding of relevant frameworks—such as NIST CSF, NIST 800‑34, ISO 27031, ISO 22301, COBIT, and ITIL—is important to ensure alignment with industry standards and regulatory requirements.

Requirements

  • 8+ years’ experience supporting or performing a Business Continuity Management or IT Disaster Recovery role.
  • Bachelor’s degree and/or the equivalent combination of education and experience.
  • Understanding of Cloud infrastructure, database, and application development and design.
  • Independent, action-oriented and engagement focused on identifying ways to improve resiliency.
  • Functional knowledge of frameworks such as NIST, ISO 27031 & ISO 22301, COBIT, and ITIL.
  • Experience working with SRE, DiRT, and Chaos Engineering practices.
  • Thorough knowledge and understanding of business continuity and disaster recovery planning techniques, technologies and best practices, methods used in performing risk analysis and business impact analyses.
  • Strong familiarity with AWS services relevant to DR/HA and resilient architectures, including AWS Config, CloudFormation, Load Balancers, Autoscaling, AWS Resilience Hub, AWS Elastic Disaster Recovery.
  • Experience working with enterprise Risk Management solutions (Such as ServiceNow, Archer, Resolver, etc.)
  • Ability to travel 10% of the time.
  • Pass required background check.
  • Clear written and verbal communication skills.
  • Ability to work independently and without direct supervision.

Nice To Haves

  • Domain Knowledge of Chaos Engineering / Fault Injection and Disaster Recovery best practices.
  • Skilled on Compute and Storage topology, design, and administration in a Microsoft/Unix/Linux environment.
  • Understanding of AWS Regions and Availability Zone concepts, including relationship of various AWS services (EC2, S3, IAM, RDS, etc.).
  • Experience working in an enterprise IT environment (on prem & cloud), evaluating IT system resiliency via recovery plans inclusive of logical and physical (Visio) diagrams.
  • Possesses strong analytical skills to effectively influence recommendations and decision-making, assess impacts, compare solutions, problem solve, and achieve business and/or technical objectives.
  • Ability to act as a change agent, leads and welcomes innovative ideas and drives continuous improvement and service optimization.
  • Information Technology Infrastructure Library (ITIL)
  • AWS Certified Cloud Practitioner
  • AWS Certified Solutions Architect
  • Experience working with project management tools that support Agile environments (Jira/Confluence).

Responsibilities

  • Lead cyber resilience and IT risk assessments, including tabletop exercises and continuity simulations.
  • Coordinate and oversee resilience and recovery testing, ensuring plans are followed, issues are logged, and results are communicated to stakeholders.
  • Support enterprise preparedness and response efforts for cyber incidents and operational disruptions impacting Stride.
  • Partner with leadership and technical teams to identify resilience gaps, validate requirements, and design solutions that meet or exceed RTO/RPO targets.
  • Document resilience and recovery processes across Stride's technology environment to ensure alignment with business, client, and audit expectations.
  • Contribute to the development and execution of Stride's Resiliency & Chaos Engineering strategy, including processes, tooling, and testing frameworks.
  • Ensure resilience capabilities - redundancy, failover, backups, and cyber-recovery solutions - are properly designed, maintained, and validated.
  • Maintain Disaster Recovery and Cyber Resilience Plans, report readiness to leadership, and track remediation efforts to closure.
  • Provide expert guidance and coordinate cross-functional teams in developing, documenting and validating recovery and resilience procedures.

Benefits

  • Stride offers a robust benefits package for eligible employees that can include health benefits, retirement contributions, and paid time off.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service