Infrastructure Disaster Recovery Technical Lead

Bright Horizons Children's CentersNewton, NE
Hybrid

About The Position

The Infrastructure Disaster Recovery Technical Lead is responsible for the design, implementation, and operational excellence of enterprise infrastructure systems with a strong emphasis on Disaster Recovery (DR), Business Continuity, and Azure Cloud platforms. This is a hands-on technical leadership role focused on delivering resilient, scalable, and secure infrastructure across hybrid environments, including both Azure cloud and on-premise datacenters. Bright Horizons is a leading education and care company that helps employees thrive at work and at home by partnering with employers to offer high-quality child care , elder care, and educational support. Our workplace reflects this commitment—with collaborative environments, meaningful benefits, and a culture that supports both career growth and personal well-being. W hether you’re caring for children or powering the systems and partnerships that make it all possible , a t Bright Horizons, you’re the difference . This is a hybrid role and requires onsite work in our Newton, MA office.

Requirements

  • Bachelor's Degree in Engineering, Technology or related field
  • 5+ years of experience in infrastructure engineering, cloud operations, or platform engineering
  • Experience across both Azure cloud and on-premise infrastructure environments.
  • Experience with high availability architectures, backup and recovery strategies, and hybrid infrastructure design.

Nice To Haves

  • Proven experience designing and implementing Disaster Recovery and Business Continuity solutions.
  • Hands-on experience with automation/scripting (e.g., Ansible, PowerShell, Python).
  • Azure certifications (e.g., Azure Solutions Architect Expert, Azure Administrator).
  • Experience with hybrid cloud platforms and migrations.
  • Experience with VMware or other virtualization technologies.
  • Experience with enterprise storage and backup solutions.
  • Experience with Azure governance and cost optimization.
  • Familiarity with ITIL or modern service management practices.
  • Strong hands-on technical expertise across hybrid environments.
  • Strategic thinking with practical, execution-focused delivery.
  • Advanced troubleshooting and root cause analysis skills.
  • Ability to operate effectively in high-availability, business-critical environments.
  • Strong communication skills across technical and business stakeholders.

Responsibilities

  • Lead architecture, design, and deployment of Azure-based and on-premise infrastructure solutions, including IaaS, PaaS, and hybrid environments.
  • Serve as a senior technical authority for infrastructure resiliency, availability, and recovery strategies.
  • Define and enforce RTO/RPO objectives across critical systems and applications.
  • Drive adoption of Azure-native resiliency services (e.g., Azure Site Recovery, Backup, Availability Zones).
  • Design, implement, and maintain end-to-end Disaster Recovery (DR) solutions across cloud and on-premise platforms.
  • Ensure alignment of DR strategies across Azure workloads and on-premise infrastructure (VMware, physical servers, storage systems).
  • Lead DR testing, failover/failback exercises, and simulation scenarios.
  • Ensure compliance with internal controls, audit requirements, and regulatory standards.
  • Identify infrastructure risks and implement mitigation strategies to ensure business continuity.
  • Oversee and optimize day-to-day infrastructure operations across both cloud and on-premise environments.
  • Maintain high availability of compute platforms (VMs, containers, physical servers) and storage systems (SAN/NAS, Azure Storage).
  • Drive standardization and modernization of on-premise environments into hybrid cloud models.
  • Ensure proactive monitoring, alerting, and incident response for all critical platforms.
  • Implement Infrastructure-as-Code (IaC) and automation using tools such as Ansible and Terraform (optional).
  • Develop automated solutions for infrastructure provisioning, patch management, and DR failover orchestration.
  • Promote engineering best practices, including configuration management and version control.

Benefits

  • Medical, dental, and vision insurance
  • Paid vacation, sick, holiday, and parental bonding leave
  • 401(k) retirement plan
  • Long-term and short-term disability insurance
  • Life insurance
  • Money-saving discounts and financial planning tools
  • Tuition assistance and education coaching
  • Caregiving support and resources for the children and adults in your family
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service