At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. The Quality Engineering team is looking for an experienced Failure Analysis Engineer focused on Power and Thermal, with strong expertise in power behavior, thermal analysis, liquid-cooling performance, failure isolation, and rail bring-up. This individual will support customer and factory failure investigations for GPU accelerators, with primary ownership of PCB triage and board-level fault isolation for power- and thermal-related issues. They will review schematics and layouts to develop targeted debug strategies, set up scope measurements and diagnostics, run functional test DOE’s to reproduce and isolate failures, and work closely with design, validation, FW, and manufacturing teams to accelerate root cause analysis and corrective actions. Your contributions will directly impact product quality, reliability, and customer satisfaction. The ideal candidate is a hands-on engineer with a strong hardware foundation and deep experience in power- and thermal-related failure analysis, debug, and board bring-up. They bring a strong analytical mindset and are skilled at triaging complex PCB failures by narrowing issues to the board, component, rail, thermal condition, cooling behavior, or system interaction level. They are comfortable reviewing schematics, setting up scope captures, running diagnostics, and designing functional test DOE’s to reproduce and isolate hard-to-find failures, while working effectively across design, validation, manufacturing, and repair teams. A strong understanding of liquid-cooling fundamentals—including flow rates, heat dissipation, and thermal transfer behavior—is important for this role. Their communication and documentation skills enable clear reporting and collaboration, and their curiosity and persistence help drive timely, high-quality root cause analysis and corrective actions.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior