Data Center Engineer

Advanced Micro Devices, IncAustin, TX
Onsite

About The Position

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. In this critical and highly technical role, you will be responsible for executing AMD's Datacenter graphics hardware/software subsystem projects for AMD OEM partners and enterprise commercial end-customers. This position provides a unique opportunity to leverage your expertise in graphics, compute, datacenter technologies, virtualization, AI/Machine Learning, and program management to collaborate with customers utilizing AMD Instinct™ Accelerators. You must be a team player with a strong commitment to meeting deadlines and the ability to thrive in a fast-paced, multi-tasking environment.

Requirements

  • Exceptional datacenter deployment and troubleshooting skills in AI GPU hardware, software, and networking.
  • Highly analytical, detail-oriented, self-motivated, and maintain a positive, results-driven attitude.
  • Team player with a strong commitment to meeting deadlines and the ability to thrive in a fast-paced, multi-tasking environment.

Nice To Haves

  • Experience in Datacenter customer support roles.
  • Experience in large-scale cluster deployment within hyperscale datacenters.
  • Experience in server architecture and functionality, including remote management, network topologies, and graphics software/hardware subsystems.
  • Experience in Linux installation, setup, usage, and debugging.
  • Experience with virtual environments (e.g., VMWare, Citrix, KVM, Microsoft) and virtual machine setup/management.
  • Experience with datacenter GPU software stacks such as AMD ROCm™ or Nvidia CUDA.
  • Experience validating multimode AI clusters using AMD tools (e.g., AGFHC, RCCL RDMA) or equivalents.
  • Experience with AI/Machine Learning workloads, frameworks, and models.
  • Strong debugging, problem-solving, and analytical skills.
  • Excellent verbal and written communication skills for conveying technical information.
  • Self-starter with attention to detail, organizational skills, and the ability to multitask in a fast-paced environment.
  • Technical certifications in relevant software systems are highly desirable.

Responsibilities

  • Perform node and cluster-level software installation and validation for GPU/compute AI and Machine Learning projects.
  • Resolve technical issues for customers utilizing AMD Instinct™ products.
  • Provide technical guidance and support to customers for server graphics and compute projects related to AI and Machine Learning workloads.
  • Build datacenter GPU dockers and containers for customer testing and deployment.
  • Qualify and assess new software functionality to ensure compatibility with customer requirements.
  • Assist development teams in identifying and resolving hardware/software technical issues throughout the product lifecycle, from initial hardware bring-up to end-of-life.
  • Follow procedures to communicate, report, and escalate incidents to AMD Management.
  • Collaborate with program managers to maintain project schedules, track action items, ensure deliverables are met, and provide project status updates to customers and AMD management.
  • Develop a strong understanding of the client’s business to ensure impactful and effective task completion.

Benefits

  • AMD benefits at a glance.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service