Sr. Staff AI/ML Development Engineer

Advanced Micro Devices, IncBellevue, WA
6hHybrid

About The Position

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. The Model Performance & Optimization team within AMD’s AI Group is seeking an accomplished engineer to lead efforts in advancing AI inference and training performance on AMD GPU platforms. Our team works with cutting-edge customer models, industry-standard benchmarks, and emerging AI paradigms to analyze and optimize their performance across AMD’s latest hardware offerings. We also use insights gained from these activities to inform future hardware and software roadmap development. As part of this process, the team develops tools to improve visibility and automate analysis of model behavior. THE PERSON: The ideal candidate will have knowledge of, experience with, and passion for both AI/ML models/applications and hardware/software systems. You’ll enjoy learning and be skilled at communicating what you’ve learned with others.

Nice To Haves

  • Experience developing and debugging in Python and ML frameworks such as Pytorch and JAX
  • Experience with current state-of-the-art AI models, especially in NLP and Generative AI
  • Experience optimizing the performance of AI models on accelerator hardware, including:
  • Understanding of AI model optimization techniques such as operator fusion, quantization, and sparsity
  • Solid understanding of hardware & software systems, including distributed systems
  • Ability to analyze and address performance bottlenecks
  • Low-level GPU programming and performance optimization experience a plus

Responsibilities

  • Follow the latest developments in AI/ML models and techniques through publications, blog posts, conferences, etc.
  • Develop innovative new techniques that leverage the strength of AMD platforms for improved performance and higher quality and share them with the AI community through publications and open-source contributions
  • Drive revenue growth by partnering with customers to optimize their AI workloads on AMD platforms
  • Develop, maintain, and enhance both internal and open-source tools for profiling, analyzing, and projecting performance
  • Identify strategic opportunities for AMD in emerging AI technologies
  • Communicate findings with AMD internal teams and, as appropriate, the external AI community
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service