Infrastructure Disaster Recovery Technical Lead

Bright Horizons Family SolutionsNewton, MA
$111,000 - $128,000Hybrid

About The Position

The Infrastructure Disaster Recovery Technical Lead is responsible for the design, implementation, and operational excellence of enterprise infrastructure systems with a strong emphasis on Disaster Recovery (DR), Business Continuity, and Azure Cloud platforms. This is a hands-on technical leadership role focused on delivering resilient, scalable, and secure infrastructure across hybrid environments, including both Azure cloud and on-premise datacenters. Bright Horizons is a leading education and care company that helps employees thrive at work and at home by partnering with employers to offer high-quality child care, elder care, and educational support. Our workplace reflects this commitment—with collaborative environments, meaningful benefits, and a culture that supports both career growth and personal well-being. Whether you’re caring for children or powering the systems and partnerships that make it all possible, at Bright Horizons, you’re the difference. This is a hybrid role and requires onsite work in our Newton, MA office.

Requirements

  • Bachelor's Degree in Engineering, Technology or related field
  • 5+ years of experience in infrastructure engineering, cloud operations, or platform engineering
  • Experience across both Azure cloud and on-premise infrastructure environments.
  • Experience with high availability architectures, backup and recovery strategies, and hybrid infrastructure design.

Nice To Haves

  • Proven experience designing and implementing Disaster Recovery and Business Continuity solutions.
  • Hands-on experience with automation/scripting (e.g., Ansible, PowerShell, Python).
  • Azure certifications (e.g., Azure Solutions Architect Expert, Azure Administrator).
  • Experience with hybrid cloud platforms and migrations.
  • Experience with VMware or other virtualization technologies.
  • Experience with enterprise storage and backup solutions.
  • Experience with Azure governance and cost optimization.
  • Familiarity with ITIL or modern service management practices.
  • Strong hands-on technical expertise across hybrid environments.
  • Strategic thinking with practical, execution-focused delivery.
  • Advanced troubleshooting and root cause analysis skills.
  • Ability to operate effectively in high-availability, business-critical environments.
  • Strong communication skills across technical and business stakeholders.

Responsibilities

  • Lead architecture, design, and deployment of Azure-based and on-premise infrastructure solutions, including IaaS, PaaS, and hybrid environments.
  • Serve as a senior technical authority for infrastructure resiliency, availability, and recovery strategies.
  • Define and enforce RTO/RPO objectives across critical systems and applications.
  • Drive adoption of Azure-native resiliency services (e.g., Azure Site Recovery, Backup, Availability Zones).
  • Design, implement, and maintain end-to-end Disaster Recovery (DR) solutions across cloud and on-premise platforms.
  • Ensure alignment of DR strategies across Azure workloads and on-premise infrastructure (VMware, physical servers, storage systems).
  • Lead DR testing, failover/failback exercises, and simulation scenarios.
  • Ensure compliance with internal controls, audit requirements, and regulatory standards.
  • Identify infrastructure risks and implement mitigation strategies to ensure business continuity.
  • Oversee and optimize day-to-day infrastructure operations across both cloud and on-premise environments.
  • Maintain high availability of compute platforms (VMs, containers, physical servers) and storage systems (SAN/NAS, Azure Storage).
  • Drive standardization and modernization of on-premise environments into hybrid cloud models.
  • Ensure proactive monitoring, alerting, and incident response for all critical platforms.
  • Implement Infrastructure-as-Code (IaC) and automation using tools such as Ansible and Terraform (optional).
  • Develop automated solutions for infrastructure provisioning, patch management, and DR failover orchestration.
  • Promote engineering best practices, including configuration management and version control.

Benefits

  • Medical, dental, and vision insurance
  • Paid vacation, sick, holiday, and parental bonding leave
  • 401(k) retirement plan
  • Long-term and short-term disability insurance
  • Life insurance
  • Money-saving discounts and financial planning tools
  • Tuition assistance and education coaching
  • Caregiving support and resources for the children and adults in your family
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service