Hardware Diagnostics Software Development Eng.

Advanced Micro Devices, IncSanta Clara, CA
4h

About The Position

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE: AMD is looking for a strategic software engineering lead who is passionate about improving the performance of key applications and benchmarks. You will be a member of a core team of incredibly talented industry specialists and will work with the very latest hardware and software technology. THE PERSON: The ideal candidate should be passionate about software engineering and possess leadership skills to drive sophisticated issues to resolution. Able to communicate effectively and work optimally with different teams across AMD.

Requirements

  • 12+ years of experience in embedded systems, low-level software, or hardware bring-up for complex platforms.
  • Extensive board and system bring-up experience, from first silicon through production and manufacturing ramp.
  • Expert-level proficiency in C/C++ for low-level diagnostics, firmware, and system software.
  • Strong experience using Python for automation, test frameworks, data analysis, and tooling.
  • Deep technical knowledge of: PCIe Gen5/Gen6 High-speed SerDes (100/200Gbps+) I2C, SPI, UART, memory interfaces, and board-level protocols
  • Proven hands-on experience with high-speed debug, including: Signal integrity analysis Link training failures Margining and performance validation
  • Strong understanding of Linux internals, kernel drivers, boot flow, and system configuration.
  • Expert-level comfort with hardware debug tools: JTAG Logic analyzers Protocol analyzers (PCIe / SerDes) Oscilloscopes and lab instrumentation
  • Demonstrated ability to lead technically ambiguous problems and drive them to resolution across multiple teams.
  • Excellent communication skills, able to clearly explain complex hardware/software issues to diverse audiences.

Nice To Haves

  • Prior work on data center platforms such as servers, NICs, switches, accelerators, or infrastructure appliances.
  • Experience acting as a technical lead or architect for bring-up or diagnostics across multiple programs.
  • Knowledge of Go (Golang) for tooling or infrastructure software.
  • Familiarity with: Manufacturing test flows and factory diagnostics High-availability systems and reliability testing Telemetry, observability, and field diagnostics
  • Exposure to hardware security, secure boot, firmware validation, or trust chains.
  • Experience influencing hardware design for testability and debug (DFT/DFD).

Responsibilities

  • Define and lead bring-up and diagnostics strategy for next-generation data center hardware platforms, spanning early silicon through high-volume manufacturing and field deployment.
  • Act as a technical authority for system-level bring-up, driving debug across ASIC, board, firmware, OS, and manufacturing test environments.
  • Architect diagnostic software and frameworks used across lab, factory, and field, ensuring scalability, reusability, and long-term maintainability.
  • Lead early silicon enablement, including power-on sequencing, low-level initialization, strap/config validation, and pre-OS diagnostics.
  • Drive root-cause analysis of complex, cross-layer issues, including: PCIe Gen5/Gen6 enumeration, link training, and stability High-speed SerDes (100/200Gbps+) signal integrity and margining Memory, I2C, SPI, UART, and board-level interfaces
  • Partner closely with silicon design, board design, firmware, validation, manufacturing, and software teams to: Influence hardware and firmware design for debuggability Close issues efficiently from symptom to silicon/board root cause Establish clear bring-up and validation milestones
  • Serve as a technical mentor and multiplier, guiding other engineers on bring-up methodology, debug techniques, and diagnostic best practices.
  • Own diagnostics architecture and roadmap, including design reviews, coding standards, test coverage, and long-term technical direction.
  • Lead cross-functional debug efforts, often coordinating multiple teams during high-pressure bring-up or manufacturing blocking issues.
  • Provide hands-on leadership — actively debugging systems, writing critical code, reviewing schematics, and analyzing waveforms when needed.

Benefits

  • AMD benefits at a glance.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service