Data Center Systems Engineer

Advanced Micro Devices, IncAustin, TX
6hOnsite

About The Position

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. The Data Center Platform Engineering Group (DPEG) is seeking an on-site Systems Design Engineer to support complex data center deployments and platform validation efforts. This role sits at the intersection of hardware, firmware, software, and system-level debugging, working hands-on with cutting-edge platforms from initial bring-up through full deployment and validation. In this position, you’ll work closely with validation engineers, platform architects, firmware teams, and operations partners to stand up complete systems—installing hardware, configuring firmware and operating systems, setting up workloads and tools, and ensuring all layers function together as designed. As systems scale and evolve, you’ll play a key role in debugging, root-cause analysis, and driving issues to resolution, directly impacting the reliability and performance of next-generation data center platforms. This is a highly collaborative, fast-paced lab environment where you’ll gain deep exposure to platform architecture, system-level debug methodologies, and cross-functional problem solving while supporting critical business priorities. THE PERSON The ideal candidate is hands-on, curious, and thrives in technically complex environments. You enjoy solving ambiguous problems, digging into system-level issues, and working closely with others to move challenges forward. You communicate clearly—especially when documenting issues or explaining technical status—and are comfortable taking direction while also identifying opportunities to improve processes and workflows. You’re adaptable, organized, and motivated by seeing systems come together end-to-end, from bare hardware to fully validated platforms running real workloads.

Requirements

  • System-level debugging and root-cause analysis
  • Background in hardware design, platform validation, or verification
  • Linux experience with the ability to collect metrics and troubleshoot issues
  • Familiarity with firmware and debug environments (e.g., BMC, BIOS)
  • Experience using ticketing systems and basic scripting for automation or analysis
  • Strong collaboration and communication skills with internal and external stakeholders

Nice To Haves

  • Exposure to test automation and system-level debug methodologies
  • Hands-on experience with hardware lab equipment (e.g., Dediprog, oscilloscopes, lift tools)

Responsibilities

  • Deploy and configure a wide range of data center platforms to support validation and verification efforts
  • Set up hardware, install firmware, operating systems, workloads, and validation tools
  • Debug system and platform issues, performing root-cause analysis across hardware, firmware, and software layers
  • Track, document, and communicate issues clearly through ticketing systems to ensure priorities are addressed
  • Collaborate with cross-functional teams to follow and refine deployment processes and debug methodologies
  • Support continuous improvement initiatives to enhance platform reliability and operational efficiency

Benefits

  • AMD benefits at a glance.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service