AI Data Center Solution Architect – GPU Platforms

Advanced Micro Devices, IncAustin, TX
Hybrid

About The Position

This role blends deep GPU cluster architecture expertise with customer-facing technical leadership. It requires a systems-level thinker who can translate customer requirements into complete, buildable data center designs—spanning compute, networking, storage, power, cooling, and physical layout. You are comfortable operating as a trusted technical advisor to customers, owning the end-to-end technical narrative from early solutioning through Basis of Design development, while aligning AMD internal teams, partners, and OEMs around a coherent architecture. Success in this role requires strong judgment, cross-domain fluency, and the ability to turn complex constraints into clear, defensible designs.

Requirements

  • Experience designing large-scale GPU or HPC clusters for AI workloads.
  • Understanding of data center architecture including: High-speed Ethernet networking, Distributed and parallel storage systems, Power distribution, redundancy, and capacity planning, Air and liquid cooling solutions, Physical layout and rack-level design.
  • Experience producing or reviewing Basis of Design, reference architectures, or similar engineering documentation.
  • Customer-facing technical experience with the ability to clearly communicate complex tradeoffs.
  • Strong systems mindset with the ability to connect compute, network, storage, and facilities constraints.
  • Excellent written and verbal communication skills.
  • High ownership, attention to detail, and ability to operate independently across ambiguous problem spaces.

Responsibilities

  • Lead customer engagements to define AI GPU cluster architecture and develop complete Basis of Design (BoD) documents.
  • Architect end-to-end data center solutions including GPU platforms, high-speed networking, storage, power distribution, cooling, and floor layout.
  • Translate AI workload requirements (training, inference, mixed workloads) into scalable, reliable infrastructure designs optimized for AMD platforms.
  • Partner with customers, OEMs/ODMs, and AMD internal teams to align on technical decisions and tradeoffs.
  • Perform architecture reviews, gap analysis, and risk assessments to ensure designs are deployable and operationally sound.
  • Provide guidance on best practices for power redundancy, cooling strategies, network topology, and cluster scalability.
  • Support pre-sales, solution reviews, and executive-level technical discussions as needed.

Benefits

  • AMD benefits at a glance.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service