System Design & Debug Manager – AI Customer Engineering

Advanced Micro Devices, IncSanta Clara, CA
Hybrid

About The Position

This role serves as the debug execution backbone of AMD’s AI Customer Engineering organization, driving complex silicon, system, and fleet-level issues to resolution across all major customer segments. The System Design Manager plays a critical role in ensuring customer success, product quality, and large-scale deployment confidence through disciplined, end‑to‑end debug execution. This is a high-visibility, high-impact position requiring deep technical expertise and strong cross-functional program leadership.

Requirements

  • Bachelor’s degree in Electrical Engineering, Computer Engineering, Computer Science, or related field required
  • Deep hands-on experience with silicon debug (pre‑silicon and post‑silicon)
  • Strong background in product development, debug tools, validation, failure analysis, or customer engineering
  • Proven experience managing complex debug programs across multiple customer segments
  • Strong functional team and project management skills with ability to drive execution across global, cross-functional teams
  • Excellent written and verbal communication skills, including executive-level engagement

Nice To Haves

  • Deep understanding of data center system architecture (CPU, GPU, FPGA, memory, connectivity, RAS, hotplug)
  • Familiarity with hardware bring up, validation, manufacturing, and test flows
  • Knowledge of reliability and quality metrics (yield, DPM, FIT)
  • Proven years of experience in the semiconductor industry
  • Master’s degree preferred

Responsibilities

  • Lead debug execution across hyperscale, OEM, HPC, and enterprise customer programs.
  • Own high‑impact, cross‑customer and systemic issues and maintain visibility into top risks and trends.
  • Partner with Customer Program Managers to align debug execution with customer deliverables, platform readiness, and deployment schedules.
  • Support escalations and executive‑level customer engagements.
  • Drive cross‑functional debug efforts across design, validation, product engineering, and failure analysis.
  • Align pre‑ and post‑silicon debug strategies and connect lab debug to real‑world customer environments.
  • Lead resolution of field failures, fleet anomalies, and data center reliability issues.
  • Aggregate fleet, RMA, and production signals and feed learnings back into design, validation, and manufacturing.
  • Own debug tracking, prioritization, risk management, and executive reporting.
  • Apply structured methodologies (8D, CAPA, FMEA) and drive continuous improvement in execution speed and consistency.

Benefits

  • AMD benefits at a glance
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service