Staff Power Attainment Engineer

Advanced Micro Devices, IncAustin, TX
16hOnsite

About The Position

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. About the Team The Data Center GPU Power and Performance Attainment (PPA) Team is a hardware‑focused lab organization responsible for optimizing power, performance, and performance‑per‑watt across AMD’s Data Center GPU products. The team works at the intersection of silicon, systems, firmware, and workloads, driving post‑silicon validation, power feature tuning, and product readiness for large‑scale AI and HPC deployments. The Opportunity As a Power Attainment Engineer, you will play a critical role in ensuring AMD’s Data Center GPUs deliver industry‑leading performance within power and reliability limits. You will work hands‑on in the lab, tuning power management features, electrically stress testing silicon and platform, correlating models to silicon behavior, and debugging complex system‑level interactions. You will collaborate closely with architects, firmware teams, board and platform engineers, performance teams, and customer engineering to bring robust, power‑efficient products to market.

Requirements

  • Strong background in power management, power/performance optimization, and post‑silicon validation for GPU or SoC platforms.
  • Hands‑on experience with system‑level electrical validation and lab instrumentation.
  • Proficiency in Python scripting and automation for scalable validation and analysis.
  • Ability to work effectively across architecture, firmware, hardware, and performance teams.
  • Proven ownership and ability to independently drive complex technical problems to closure.

Nice To Haves

  • 10+ years of experience in the semiconductor industry, with focus on power, performance, or system validation.
  • Solid understanding of GPU architecture, computer organization, and power management techniques.
  • Experience with power limited performance methodologies and basic control theory concepts.
  • Familiarity with HPC and AI workloads and benchmarking.
  • Experience working in data center environments (boards, systems, racks, clusters).
  • Experience with AI tools driving generational efficiency improvements in tuning, analysis & debug.

Responsibilities

  • Execute post silicon power and performance validation on Data Center GPU platforms.
  • Tune and optimize power management features to improve performance per watt.
  • Bring up and electrically characterize ML/AI GPU systems in lab environments.
  • Use oscilloscopes, probes, DAQs, and power measurement tools for detailed electrical analysis.
  • Analyze power and performance data, validate results, and drive feature productization.
  • Debug complex interactions across silicon, firmware, board, and system domains.
  • Develop and maintain automation for workload execution, data processing, and analysis (Linux/Python).
  • Partner with rack and cluster teams on end to end electrical validation of large scale systems.
  • Support customer issue debug by designing targeted experiments and DOEs.
  • Provide technical leadership, mentor junior engineers, and communicate results to stakeholders.

Benefits

  • Competitive compensation, benefits, and global career opportunities.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service