Failure Analysis Engineer

MetaFremont, CA
Onsite

About The Position

Be part of a centralized Product Integrity Team. This is an opportunity to influence the future of Meta’s infrastructure as we build FA framework from the grounds up that is needed to scale our current services and support long term goal of meeting Artificial General Intelligence (AGI). Candidate gets to work on state of the projects including custom ASICs, Robotics and able to spruce FA relationship with worldwide supply and manufacturing base. Candidate will be applying knowledge of system hardware, electrical, and materials analysis to perform failure analysis across different issues and provide a turnkey solution to arrive on corrective actions.

Requirements

  • Bachelors in Electrical or Mechanical engineering or Materials Science with 6+ years of failure analysis experience
  • Proven proficiency with the use of eFA equipment including Oscilloscopes, spectrum analyzers, frequency generators, source meters, and multimeters
  • Hands-on experience with SEM, DB-FIB, mechanical polishing, dicing, EDS, mechanical probing, EBIC/EBAC, OBIRCH, LADA, TIVA, etc
  • Systematic FA approach application and experience to methodically break down a system or process into subparts to isolate failures and troubleshoot to a component level

Nice To Haves

  • Masters or PhD in Electrical Engineer or Materials Science with direct Data Center HW FA experience
  • Experience analyzing datacenter failures (rack level hardware and/or datacenter infrastructure hardware)
  • Fault localization skills for PCIE gen 5/6 and DDR gen 5/6
  • Fault localization skills for custom Si/ASICs

Responsibilities

  • Responsible for all aspects of identifying and characterizing failures on Data Center HW that leads to successful identification of RootCause of the failure
  • Experience using FA equipment such as Optical/Digital Microscopes, Oscilloscopes, X-ray, SEM needed to successfully complete FA projects
  • Work closely with HW Reliability to Root Cause reliability failures
  • Recreate and identify failures observed in engineering, production, and field as needed for efficient fault isolation
  • Debugging thermal related PCB and electrical module failures
  • Develop Corrective actions for various failures observed based on Physics Of Failure methodology and validate the same
  • Taking additional steps to ensure prevention of similar issues from repeating on new NPI programs
  • Quantify FA in terms of improvements in yield, reliability, serviceability, manufacturability, testability, capacity, and cost when needed
  • Validate and verify supplier and vendor FA
  • Work with cross functional teams from all disciplines to determine and or discuss Root Cause
  • SOP creation for vendors and suppliers to repeat similar FA analysis if and when needed
  • Lab organization and detailed documentation leading to effective FA reports

Benefits

  • bonus
  • equity
  • benefits

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service