Annapurna Labs, a key part of AWS innovation, is responsible for silicon development. This role focuses on AWS Machine Learning accelerators, specifically the Inferentia and Trainium chips, which are crucial for Generative AI on AWS. The AWS Neuron Software Development Kit (SDK), including an ML compiler and runtime, integrates with popular ML frameworks like PyTorch, TensorFlow, and JAX, and is used extensively by customers. The Neuron Compiler team is developing a deep learning compiler stack to optimize LLM and Vision models for AWS accelerators, aiming for significant performance improvements. As a Compiler Engineer II, you will contribute to the development and scaling of this compiler for large ML workloads, architecting and implementing features, publishing research, and collaborating with AWS ML services teams. You will be involved in pre-silicon design and bringing new products to market.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level