Manager, DevOps Engineering

Teladoc HealthNew York City, NY
Remote

About The Position

Teladoc Health Inc. seeks Manager, DevOps Engineering (Multiple Openings) at its facility located at 155 E 44th Street, 17th Floor, New York, NY 10017. This role involves working with engineering development leadership to build shared cloud infrastructure and application services that meet the requirements and needs of the commercial platform and application teams, ensuring services are designed with 24/7 availability and operational maturity. The position requires leading, managing, and inspiring a team of DevOps engineers, building and maintaining a highly effective release management system, and assigning and monitoring the work of technical personnel. The role includes implementing quality control and review systems, conducting periodic reviews with stakeholders, and identifying/executing preventive measures to minimize customer impact. Analysis of application performance against Technical Operations SLA, collaboration with cross-functional teams, and implementation of proactive monitoring, alerting, and self-healing systems are key responsibilities. The role also involves initiating and driving service improvement plans, managing the analysis and approval of new code through security and performance gates, and advocating for security and performance standards. Operational aspects of production and development servers, including developing and validating compliance with procedures, are managed. Adherence to ITIL and other audited regulatory compliance programs is required. The position involves updating management on critical incidents, acting as a point of contact, and working with various teams and vendors to resolve issues and provide expertise for enhancements. Maintaining high-quality system technical specifications, process procedures, runbooks, SOPs, and infrastructure requirements definitions is essential. The development of SaaS public and private cloud infrastructure must be cost-conscious, utilizing "infrastructure as code" and automated provisioning tools. Participation in the full software and infrastructure development life cycle, including requirements analysis and design implementation, is expected. Consultation with Product Management for prototyping, refining, testing, and shipping products, as well as participating in the implementation of new customer features, products, and utilities, is required. Identifying and evaluating new technologies and analyzing user needs to determine technical requirements are also part of the role. Collaboration with Operational, Development, and Architectural teams to ensure operational maturity requirements (reliability, availability, scalability, observability, performance, capacity etc.) are met, and recommending operational improvements are crucial. Ensuring internally developed and externally acquired solutions are appropriately instrumented and monitored according to Teladoc standards is necessary. Managerial work to accomplish tasks and projects within defined timelines and in a professional manner is expected. This role is 100% Telecommuting and supervises 9 individuals.

Requirements

  • Version management and ticketing systems such as Git and Jira
  • Deploying, operating and troubleshooting web application software on Unix/Linux systems
  • Continuous integration, testing and deployment with tools such as Jenkins, Azure DevOps, Bamboo CI, etc.
  • Working with large scale infrastructure in AWS or Azure public cloud
  • Designing systems with high availability and disaster recovery
  • Monitoring, metrics and visualization tools such as New Relic, Sensu, Nagios, etc.
  • Unix/Linux system administration and troubleshooting

Responsibilities

  • Work with engineering development leadership to build shared cloud infrastructure and application services that meet the requirements and needs of the commercial platform and application teams.
  • Ensure services are designed with 24/7 availability and operational maturity.
  • Lead, manage and inspire a team of DevOps engineers.
  • Build and maintain a highly effective release management system.
  • Assign and monitor the work of technical personnel, ensuring that application infrastructure development and deployment is done in the best possible way, and implement quality control and review systems throughout the development and deployment processes.
  • Conduct periodic reviews with key stakeholders.
  • Identify, evaluate, and execute preventive measures to minimize/avoid impact to the customer's experience.
  • Analyze and review application performance against the Technical Operations SLA.
  • Work with cross-functional business teams to understand requirements and other performance SLAs.
  • Collaborate with Product and Customer Support teams to plan and deploy product releases.
  • Implement proactive monitoring, alerting, trend analysis and self-healing systems.
  • Initiate and drive service improvement plans, collaborating with our SRE and NOC teams.
  • Manage analysis and approval of new code through security and performance gates that the position will design and develop for feature-complete software.
  • Be an advocate for security and performance standards in the organization.
  • Manage operational aspect of production and development servers including developing, training in, and validating compliance with procedures and checklists related to disk space usage, monitoring solutions, deployment, conventions, access to the production and development sources, source control access and usage, performance monitoring, code modifications validation, scheduling, and more.
  • Be responsible for the team's adherence to ITIL and other audited regulatory compliance (HiTrust, FedRAMP) programs.
  • Update management in case of critical incidents and act as a point of contact for other related communications.
  • Work within IT and Technical Operations, business stakeholders and with vendors, to successfully identify, prioritize, and resolve issues and provide subject matter expertise for enhancements, developments, operational improvements to the website applications that Teladoc Health relies on.
  • Ensure system technical specifications, process, procedures, runbooks, SOPs, and infrastructure requirements definitions based on conceptual design are maintained and high-quality for all technical and non-technical deliverables to a wider organizational audience and for T3-4 level support of production workloads.
  • Ensure the development of our SaaS public and private cloud infrastructure is conducted in cost-conscious ways, using "infrastructure as code" and automated environment provisioning tools and techniques.
  • Participate in the full software and infrastructure development life cycle process including requirements analysis and design implementation.
  • Consult with the Product Management team to prototype, refine, test, and ship products to meet business needs.
  • Consult and participate in implementation of new customer features, products, and utilities.
  • Identify and evaluate new technologies for implementation.
  • Analyze user needs to determine technical requirements.
  • Collaborate with the Operational, Development and Architectural teams to ensure operational maturity requirements (reliability, availability, scalability, observability, performance, capacity etc.) are met, and recommend operational improvements to them.
  • Ensure internally developed and externally acquired custom off the shelf solutions are appropriately instrumented and monitored via the Teladoc monitoring standards.
  • Perform managerial work to accomplish tasks and projects within defined timelines and in a professional manner in alignment with active standards.

Benefits

  • Flexible Vacation Policy
  • 80 hours of Paid Sick, Safe, and Caregiver Leave annually
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service