Senior Manager, Reliability Engineering

OracleTX, TX
Onsite

About The Position

As Senior Manager – Reliability Engineering, you will lead the teams, methods, and programs responsible for improving the availability, maintainability, and lifecycle performance of mission-critical facilities infrastructure across OCI’s data center portfolio. This role sets the direction for reliability engineering practices across electrical, mechanical, and controls domains, with a strong focus on analytics, predictive maintenance, risk reduction, and standardized reliability methods. You will lead engineers and analysts who partner closely with site operations, design, construction, commissioning, and automation teams to identify reliability risks, improve maintenance strategies, strengthen incident learning, and ensure corrective actions are implemented and sustained. This role translates operational data and engineering analysis into portfolio-level standards, priorities, and decisions that protect uptime and support long-term capacity growth.

Requirements

  • Lead engineers and analysts
  • Partner closely with site operations, design, construction, commissioning, and automation teams
  • Identify reliability risks
  • Improve maintenance strategies
  • Strengthen incident learning
  • Ensure corrective actions are implemented and sustained
  • Translate operational data and engineering analysis into portfolio-level standards, priorities, and decisions

Responsibilities

  • Lead the teams, methods, and programs responsible for improving the availability, maintainability, and lifecycle performance of mission-critical facilities infrastructure across OCI’s data center portfolio.
  • Set the direction for reliability engineering practices across electrical, mechanical, and controls domains, with a strong focus on analytics, predictive maintenance, risk reduction, and standardized reliability methods.
  • Lead engineers and analysts who partner closely with site operations, design, construction, commissioning, and automation teams to identify reliability risks, improve maintenance strategies, strengthen incident learning, and ensure corrective actions are implemented and sustained.
  • Translate operational data and engineering analysis into portfolio-level standards, priorities, and decisions that protect uptime and support long-term capacity growth.

Benefits

  • Flexible medical
  • Life insurance
  • Retirement options
  • Volunteer programs
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service