Fellow, AI Workload Optimization

Advanced Micro Devices, IncBellevue, WA

About The Position

We are looking for a visionary technical leader to join the AI Software group. As a Fellow, you will be accountable for defining and driving the end-to-end software optimization strategy to achieve industry-leading performance for our top-tier customers. You will sit at the intersection of architecture, customer engagement, and software engineering, ensuring that AMD’s software stack—from ROCm and compilers to high-level AI frameworks—is tuned to extract maximum performance for the world's most demanding AI workloads.

Requirements

  • 15+ years of software development experience with at least 5 years in a high-level technical leadership role (Fellow or equivalent).
  • Deep expertise in AI Frameworks (PyTorch, JAX, vLLM, SGLang) and the ROCm software stack.
  • Proven history of optimizing distributed inference and training at scale across multi-node/multi-GPU environments.
  • Mastery of performance profiling tools (e.g., TorchProfiler, ROCm Profiler, Nsight) and hardware-level performance modeling.
  • Strong understanding of modern model architectures (Transformer, Attention, KV Cache) and optimization techniques like quantization, speculative decoding, and FlashAttention.
  • Demonstrated ability to drive cross-functional initiatives in fast-paced, ambiguous environments.
  • PhD or Master’s degree in Computer Science, Electrical Engineering, or a related field, or equivalent experience.
  • Demonstrated research or applied experience in AI/ML, including areas such as deep learning, model training/inference optimization, large language models, or computer vision.

Responsibilities

  • Set the technical vision and roadmap for workload optimization across the AI software stack, ensuring AMD remains the platform of choice for top-tier AI customers.
  • Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure "out-of-the-box" performance excellence on AMD hardware.
  • Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations.
  • Collaborate across hardware architecture, compiler, and framework teams to influence future silicon features based on evolving AI workload trends.
  • Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting.
  • Act as a technical ambassador in industry forums and open-source communities. Mentor and inspire the next generation of AMD’s technical leaders and engineers.

Benefits

  • AMD benefits at a glance.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service