As part of the AWS Applied AI Solutions organization, we have a vision to provide business applications, leveraging Amazon’s unique experience and expertise, that are used by millions of companies worldwide to manage day-to-day operations. We will accomplish this by accelerating our customers’ businesses through delivery of intuitive and differentiated technology solutions that solve enduring business challenges. We blend vision with curiosity and Amazon’s real-world experience to build opinionated, turnkey solutions. Where customers prefer to buy over build, we become their trusted partner with solutions that are no-brainers to buy and easy to use. Amazon Connect is an AI-powered customer experience solution that enables superior outcomes at a lower cost. Since its 2017 public launch, Amazon Connect has become an AI leader, transforming how organizations of all types interact with their customers. Do you want to build and optimize the infrastructure that serves frontier Large Language Models (LLMs) at massive scale, transforming how customers interact with AI-powered services? Join a world-class team of ML engineers and scientists within AWS to develop production ML systems that power next-generation applications in cloud computing. Amazon Web Services (AWS) is the world’s leading cloud platform, supporting millions of customers globally. Our customers bring complex, high-impact problems that create unique opportunities for Machine Learning Engineers to deliver solutions with immediate, real-world impact. You will operate as a technical leader, owning the design and evolution of large-scale ML infrastructure. You will partner closely with applied scientists, software engineers, and product teams to translate frontier LLM research into highly reliable, efficient, and scalable production systems. You will work with state-of-the-art GPU and custom accelerator hardware, and leverage AWS’s unmatched scale in data and compute to push the boundaries of LLM serving and optimization. As part of the team, we expect that you will design and build highly available, cost-efficient LLM serving systems, optimize inference performance across the full stack, and develop innovative ML infrastructure solutions that enable our scientists to iterate faster and our customers to experience AI capabilities at their best.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level