Failure Analysis Engineer

Hyve SolutionsFremont, CA
163d$119,000 - $140,000

About The Position

@HYVE Solutions, missions to help customers, business partners, and employees achieve success through shared goals, strategies, resources and technology solutions. Key Responsibilities Customer & Cross-Functional Collaboration Work directly with top-name customers on development and deployment of new server hardware and firmware (BIOS, RAID, DMI, FRU). Develop and maintain excellent working relationships with customers, vendors, and internal teams in support of overall company objectives. Provide support to manufacturing and customer service teams, replicating and troubleshooting field-returned failures. Drive technical issues to closure by working with internal and field engineers to identify, track, root cause, and resolve issues. Failure Analysis & Troubleshooting Perform PCBA-level troubleshooting and component-level FA to identify root causes of hardware failures. Use diagnostic equipment including oscilloscopes, logic analyzers, multimeters, thermal imaging cameras, X-ray, BGA rework stations, and ICT/boundary scan tools. Interpret and analyze schematics, Gerber files, and PCBA layout documentation to trace power rails, signal paths, and interconnects. Troubleshoot power delivery issues, signal integrity problems, and component defects on x86 server systems. Product Validation & Testing Provide input on next-generation server products by designing, building, and testing evaluation units to meet customer requirements. Balance design choices across cost, cooling and power efficiency, form factor, reliability, and specific project requirements. Qualify bleeding-edge hardware and firmware technologies at both component and platform levels. Execute and analyze power and performance benchmark tests on new server hardware. Refine standard tests to drive improvements in new technologies and better meet customer needs. Scripting & Automation Develop Python scripts to automate FA data collection, stress testing, and performance analysis. Configure and administer Linux-based OS (RedHat, SuSE, Debian) for hardware validation and debugging. ABOUT HYVE SOLUTION Hyve Solutions is a leader in the design to worldwide deployment of hyperscale digital infrastructures. In partnership with customers, Hyve leverages deep-seated industry experience and strong vendor partnerships to design and deliver purpose-built server, storage, and networking solutions to meet datacenter demands for today and beyond. Hyve Solutions is a wholly owned subsidiary of TD SYNNEX Corporation (NYSE: SNX). ABOUT TD SYNNEX CORPORATION TD SYNNEX (NYSE: SNX) is a leading global distributor and solutions aggregator for the IT ecosystem. We’re an innovative partner helping more than 150,000 customers in 100+ countries to maximize the value of technology investments, demonstrate business outcomes and unlock growth opportunities. Headquartered in Clearwater, Florida, and Fremont, California, TD SYNNEX’ 23,000 co-workers are dedicated to uniting compelling IT products, services and solutions from 1,500+ best-in-class technology vendors. Our edge-to-cloud portfolio is anchored in some of the highest-growth technology segments including cloud, cybersecurity, big data/analytics, IoT, mobility and everything as a service. TD SYNNEX is committed to serving customers and communities, and we believe we can have a positive impact on our people and our planet, intentionally acting as a respected corporate citizen. We aspire to be a diverse and inclusive employer of choice for talent across the IT ecosystem.

Requirements

  • BS in Mechanical, Electrical, Computer Engineering, Computer Science, or equivalent industry experience (3+ years).
  • Deep knowledge of server component architecture and design (motherboards, CPUs, RAM, hard drives, heat sinks, fans).
  • Working knowledge of power-related components (power supplies, PDUs, VRMs) and control interfaces (PMBus, IPMI).
  • Strong experience in PCBA-level troubleshooting, FA at the component level, and reading schematics & PCBA layout files.
  • Ability to troubleshoot x86-based systems, diagnosing hardware and software issues.
  • Proficiency in Python scripting and Linux OS administration (RedHat, SuSE, Debian).
  • Excellent written and verbal communication skills.
  • Attention to detail, strong process orientation, and organizational skills.
  • Self-starter, highly motivated, and comfortable in a fast-moving, small-team environment.

Nice To Haves

  • Experience with thermal, power, and signal integrity analysis tools.
  • Familiarity with hardware validation automation frameworks.

Responsibilities

  • Work directly with top-name customers on development and deployment of new server hardware and firmware (BIOS, RAID, DMI, FRU).
  • Develop and maintain excellent working relationships with customers, vendors, and internal teams in support of overall company objectives.
  • Provide support to manufacturing and customer service teams, replicating and troubleshooting field-returned failures.
  • Drive technical issues to closure by working with internal and field engineers to identify, track, root cause, and resolve issues.
  • Perform PCBA-level troubleshooting and component-level FA to identify root causes of hardware failures.
  • Use diagnostic equipment including oscilloscopes, logic analyzers, multimeters, thermal imaging cameras, X-ray, BGA rework stations, and ICT/boundary scan tools.
  • Interpret and analyze schematics, Gerber files, and PCBA layout documentation to trace power rails, signal paths, and interconnects.
  • Troubleshoot power delivery issues, signal integrity problems, and component defects on x86 server systems.
  • Provide input on next-generation server products by designing, building, and testing evaluation units to meet customer requirements.
  • Balance design choices across cost, cooling and power efficiency, form factor, reliability, and specific project requirements.
  • Qualify bleeding-edge hardware and firmware technologies at both component and platform levels.
  • Execute and analyze power and performance benchmark tests on new server hardware.
  • Refine standard tests to drive improvements in new technologies and better meet customer needs.
  • Develop Python scripts to automate FA data collection, stress testing, and performance analysis.
  • Configure and administer Linux-based OS (RedHat, SuSE, Debian) for hardware validation and debugging.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service