Mechanical and Thermal Reliability Engineer

MatXMountain View, CA
Onsite

About The Position

Matx is developing advanced rack- and POD-level infrastructure systems to support next-generation high-performance computing and AI workloads. Our focus is on delivering highly reliable, thermally efficient, and scalable hardware solutions, including cutting-edge liquid cooling architectures and mechanical systems. We are building vertically integrated platforms that combine mechanical design, thermal management, packaging, and system-level engineering to ensure robust performance under extreme operating conditions. We are looking for engineers who are passionate about reliability engineering, system validation, and cross-functional product development.

Requirements

  • 0–5 years of experience in mechanical and/or thermal reliability engineering in data center hardware, server systems, or high-performance computing platforms
  • Bachelor’s or Master’s degree in Mechanical Engineering, Thermal Engineering, or a related field
  • Strong understanding of Mechanical and thermal reliability principles
  • Strong understanding of Failure analysis and root cause methodologies
  • Strong understanding of Materials and process characterization
  • Experience with Design of Experiments (DOE)
  • Experience with Product validation and qualification processes
  • Ability to work effectively in cross-functional engineering teams

Nice To Haves

  • Knowledge of liquid cooling systems and architectures
  • Knowledge of Advanced packaging and interconnect technologies
  • Familiarity with industry standards such as: ISO, OCP
  • Experience working with ODM/JDM/CM partners in manufacturing environments
  • Strong analytical, problem-solving, and data-driven decision-making skills

Responsibilities

  • Develop and execute reliability strategies and qualification plans for New Product Development (NPD) programs
  • Design and implement Design of Experiments (DOE) for material, process, and product validation
  • Perform reliability testing, evaluation, and qualification for mechanical and thermal systems
  • Lead qualification readiness assessments, including material readiness and test program validation
  • Conduct failure analysis and root cause investigations, and drive corrective and preventive actions (CAPA) during development.
  • Work on system-level reliability for: Mechanical assemblies, High-speed connectors, Compute blade architectures
  • Evaluate and improve the reliability of liquid cooling systems, including: Direct-to-chip (D2C) cooling, Rack and POD-level liquid cooling infrastructure
  • Collaborate with ODM, JDM, and CM partners on L6~L11 system testing and reliability validation
  • Develop reliability reports, risk assessments, and qualification documentation
  • Contribute to product validation processes and drive improvements in product quality and reliability
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service