Director, Reliability Engineering, NA

Vantage Data CentersPort Washington, WI
1dOnsite

About The Position

About Vantage Data Centers Vantage Data Centers powers, cools, protects and connects the technology of the world’s well-known hyperscalers, cloud providers and large enterprises. Developing and operating across North America, EMEA and Asia Pacific, Vantage has evolved data center design in innovative ways to deliver dramatic gains in reliability, efficiency and sustainability in flexible environments that can scale as quickly as the market demands. Reliability Engineering Department The Reliability Engineering Team is responsible for the overall operating health of critical systems across Vantage global facilities. For each of the major systems Electrical, Mechanical, and Controls, the Reliability Engineering team is responsible for ensuring success in the commissioning stages of new construction, evaluating and improving the reliability and performance of existing critical infrastructure, sustaining equipment operational availability through maintenance program design, providing ongoing technical support to the Site Operations Teams, as well as providing systems reliability and maintainability feedback to the Design Engineering teams for future design consideration Position Overview The Director of Reliability Engineering is a key technical leader responsible for supporting the development and execution of reliability strategies across Vantage’s data center operations. This role focuses on ensuring high system uptime, operational performance, and the implementation of reliability best practices. The Director will lead a team of engineers, work closely with cross-functional teams, and contribute to strategic initiatives that enhance the reliability and efficiency of our facilities. This onsite role will be based in our new Port Washington, WI data center campus.

Requirements

  • Bachelor’s degree in Engineering, Mechanical, Electrical, or a related field, required.
  • 8+ years of experience in reliability engineering, maintenance, or operations within mission-critical environments such as data centers, utilities, or industrial facilities.
  • Experience managing engineering teams and supporting cross-functional initiatives.
  • Solid understanding of reliability engineering principles, tools (e.g., FMEA, RCA), and maintenance strategies.
  • Familiarity with data analysis tools and monitoring systems.
  • Strong problem-solving and analytical skills, with the ability to translate data into actionable insights.
  • Effective communication skills, with the ability to collaborate across technical and non-technical teams.
  • Strong organizational and time management skills, with the ability to manage multiple priorities.
  • Travel required is expected to be up to 5% but may increase over time as the business evolves.

Nice To Haves

  • Master’s degree in Engineering Management, Business Administration, or a related field, preferred.

Responsibilities

  • Support the execution of reliability engineering strategies that align with Vantage’s operational goals and customer expectations.
  • Manage and develop a team of reliability engineers, fostering a collaborative and high-performance work environment.
  • Collaborate with senior leaders and cross-functional teams to help define and implement reliability-focused initiatives.
  • Drive the application of reliability engineering principles across data center operations to support consistent system performance and uptime.
  • Oversee the execution of reliability programs, including preventive maintenance, root cause analysis, and failure mode effects analysis (FMEA).
  • Ensure compliance with applicable industry standards and internal reliability protocols.
  • Partner with Engineering, Construction, Operations, and IT teams to integrate reliability considerations into infrastructure design and operations.
  • Coordinate with vendors and external partners to evaluate and implement reliability-enhancing technologies and practices.
  • Use monitoring tools and data analytics to assess system performance, identify trends, and support operational decision-making.
  • Prepare and share reports on reliability metrics and KPIs with internal stakeholders to support transparency and continuous improvement.
  • Identify and recommend process improvements to enhance system reliability and reduce downtime.
  • Contribute to the development and standardization of reliability engineering practices across multiple sites.
  • Participate in risk mitigation planning and support business continuity efforts through reliability-focused assessments.
  • Conduct periodic system reviews and audits to identify areas for improvement and ensure operational resilience.
  • Handle additional duties as assigned by Management.

Benefits

  • This position is eligible for company benefits including but not limited to medical, dental, and vision coverage, life and AD&D, short and long-term disability coverage, paid time off, employee assistance, participation in a 401k program that includes company match, and many other additional voluntary benefits.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service