Failure Analysis Engineer

Hyve SolutionsOlive Branch, MS
14d$119,000 - $140,000

About The Position

@HYVE Solutions, missions to help customers, business partners, and employees achieve success through shared goals, strategies, resources and technology solutions. Job Description Key Responsibilities Customer & Cross-Functional Collaboration Work directly with top-name customers on development and deployment of new server hardware and firmware (BIOS, RAID, DMI, FRU). Develop and maintain excellent working relationships with customers, vendors, and internal teams in support of overall company objectives. Provide support to manufacturing and customer service teams , replicating and troubleshooting field-returned failures. Drive technical issues to closure by working with internal and field engineers to identify, track, root cause, and resolve issues. Failure Analysis & Troubleshooting Perform PCBA-level troubleshooting and component-level FA to identify root causes of hardware failures. Use diagnostic equipment including oscilloscopes, logic analyzers, multimeters, thermal imaging cameras, X-ray, BGA rework stations, and ICT/boundary scan tools . Interpret and analyze schematics, Gerber files, and PCBA layout documentation to trace power rails, signal paths, and interconnects. Troubleshoot power delivery issues, signal integrity problems, and component defects on x86 server systems. Product Validation & Testing Provide input on next-generation server products by designing, building, and testing evaluation units to meet customer requirements. Balance design choices across cost, cooling and power efficiency, form factor, reliability, and specific project requirements . Qualify bleeding-edge hardware and firmware technologies at both component and platform levels. Execute and analyze power and performance benchmark tests on new server hardware. Refine standard tests to drive improvements in new technologies and better meet customer needs. Scripting & Automation Develop Python scripts to automate FA data collection, stress testing, and performance analysis. Configure and administer Linux-based OS (RedHat, SuSE, Debian) for hardware validation and debugging.

Requirements

  • BS in Mechanical, Electrical, Computer Engineering, Computer Science, or equivalent industry experience (3+ years).
  • Deep knowledge of server component architecture and design (motherboards, CPUs, RAM, hard drives, heat sinks, fans).
  • Working knowledge of power-related components (power supplies, PDUs, VRMs) and control interfaces ( PMBus, IPMI ).
  • Strong experience in PCBA-level troubleshooting, FA at the component level, and reading schematics & PCBA layout files .
  • Ability to troubleshoot x86-based systems , diagnosing hardware and software issues.
  • Proficiency in Python scripting and Linux OS administration (RedHat, SuSE, Debian).
  • Excellent written and verbal communication skills .
  • Attention to detail, strong process orientation, and organizational skills.
  • Self-starter, highly motivated, and comfortable in a fast-moving, small-team environment .

Nice To Haves

  • Experience with thermal, power, and signal integrity analysis tools .
  • Familiarity with hardware validation automation frameworks .

Responsibilities

  • Work directly with top-name customers on development and deployment of new server hardware and firmware (BIOS, RAID, DMI, FRU).
  • Develop and maintain excellent working relationships with customers, vendors, and internal teams in support of overall company objectives.
  • Provide support to manufacturing and customer service teams , replicating and troubleshooting field-returned failures.
  • Drive technical issues to closure by working with internal and field engineers to identify, track, root cause, and resolve issues.
  • Perform PCBA-level troubleshooting and component-level FA to identify root causes of hardware failures.
  • Use diagnostic equipment including oscilloscopes, logic analyzers, multimeters, thermal imaging cameras, X-ray, BGA rework stations, and ICT/boundary scan tools .
  • Interpret and analyze schematics, Gerber files, and PCBA layout documentation to trace power rails, signal paths, and interconnects.
  • Troubleshoot power delivery issues, signal integrity problems, and component defects on x86 server systems.
  • Provide input on next-generation server products by designing, building, and testing evaluation units to meet customer requirements.
  • Balance design choices across cost, cooling and power efficiency, form factor, reliability, and specific project requirements .
  • Qualify bleeding-edge hardware and firmware technologies at both component and platform levels.
  • Execute and analyze power and performance benchmark tests on new server hardware.
  • Refine standard tests to drive improvements in new technologies and better meet customer needs.
  • Develop Python scripts to automate FA data collection, stress testing, and performance analysis.
  • Configure and administer Linux-based OS (RedHat, SuSE, Debian) for hardware validation and debugging.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service