About The Position

This role will own the development of a comprehensive Design and Product Quality & Reliability program spanning infrastructure design standards, product qualification, supplier quality expectations, reliability engineering, field performance analytics, and continuous improvement. The ideal candidate combines deep technical expertise in critical infrastructure systems with proven experience building organizations and quality programs in large-scale manufacturing, hyperscale infrastructure, semiconductor, power systems, or mission-critical environments. The Sr. Director will partner closely with Engineering, Design, Construction, Supply Chain, Product Engineering, Operations, and strategic suppliers to ensure OCI infrastructure platforms consistently meet aggressive reliability, availability, and lifecycle performance objectives.

Requirements

  • Deep technical expertise in critical infrastructure systems
  • Proven experience building organizations and quality programs in large-scale manufacturing, hyperscale infrastructure, semiconductor, power systems, or mission-critical environments.
  • Experience with FMEA, fault tree analysis, accelerated life testing, and design-for-reliability practices.
  • Experience establishing product quality benchmarks and reliability performance targets, including AFR (Annualized Failure Rate), IDR, MTBF, and other key reliability indicators.
  • Experience developing supplier quality management frameworks.
  • Experience driving root cause analysis and corrective action processes for field failures and reliability excursions.
  • Experience developing KPI dashboards and measurement systems.
  • Experience analyzing field performance data, warranty trends, operational incidents, and failure modes.
  • Experience establishing data-driven processes to recommend and implement design, component, or supplier changes.
  • Experience benchmarking performance against hyperscale and industry best practices.
  • Experience partnering with Infrastructure Engineering, Capacity Delivery, Operations, Supply Chain, and Product teams.
  • Experience influencing strategic technology and supplier selection decisions.
  • Experience providing executive-level reporting on reliability performance, risks, and improvement initiatives.

Responsibilities

  • Establish and scale OCI’s Design Quality & Reliability organization for AI data center infrastructure.
  • Develop the strategy, operating model, governance, metrics, and execution roadmap for the function.
  • Build and lead a high-performing multidisciplinary team spanning reliability engineering, supplier quality, design assurance and validation.
  • Define organizational processes and standards for quality and reliability across the infrastructure lifecycle.
  • Ensure infrastructure designs meet OCI reliability, resiliency, maintainability, and lifecycle performance requirements.
  • Drive design assurance processes that validate design intent against operational requirements and long-term reliability objectives.
  • Lead cross-functional design reviews focused on reliability risk reduction, failure prevention.
  • Establish reliability engineering methodologies including FMEA, fault tree analysis, accelerated life testing, and design-for-reliability practices.
  • Define qualification and acceptance criteria for critical infrastructure products and systems used in OCI data centers.
  • Establish product quality benchmarks and reliability performance targets, including AFR (Annualized Failure Rate), IDR, MTBF, and other key reliability indicators.
  • Develop supplier quality management frameworks and collaborate with strategic suppliers to improve product reliability and manufacturing quality.
  • Drive root cause analysis and corrective action processes for field failures and reliability excursions.
  • Develop KPI dashboards and measurement systems to benchmark design and product reliability performance across the OCI infrastructure portfolio.
  • Analyze field performance data, warranty trends, operational incidents, and failure modes to identify systemic improvement opportunities.
  • Establish data-driven processes to recommend and implement design, component, or supplier changes that improve quality, reliability, and operational efficiency.
  • Benchmark OCI performance against hyperscale and industry best practices.
  • Partner with Infrastructure Engineering, Capacity Delivery, Operations, Supply Chain, and Product teams to ensure reliability objectives are embedded throughout the lifecycle.
  • Influence strategic technology and supplier selection decisions using quality and reliability data.
  • Provide executive-level reporting on reliability performance, risks, and improvement initiatives.

Benefits

  • Flexible medical
  • Life insurance
  • Retirement options
  • Volunteer programs
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service