The AWS Neuron Compiler team is seeking skilled compiler engineers to develop a state-of-the-art deep learning compiler stack. This stack optimizes application models across diverse domains like Large Language and Vision, originating from frameworks such as PyTorch, TensorFlow, and JAX. The role involves working closely with custom-built Machine Learning accelerators, including Inferentia/Trainium, which are at the forefront of AWS innovation for advanced ML capabilities, powering solutions like Generative AI. As an ML Compiler Engineer, you will be instrumental in designing, developing, and optimizing features for the compiler. Your responsibilities will include tackling crucial challenges alongside a talented engineering team, contributing to leading-edge design and research in compiler technology and deep-learning systems software. You will also collaborate closely with cross-functional team members from the Runtime, Frameworks, and Hardware teams to ensure system-wide performance optimization. Within the Backend team, you will play a significant role in designing and developing various aspects of the system, including instruction scheduling, memory allocation, data transfer optimization, graph partitioning, parallel programming, code generation, Instruction Set Architectures, new hardware bring-up, and hardware-software co-design.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level