Sr. SoC Virtual Platform Modeling Engineer, Annapurna Labs Machine Learning Accelerators, AWS

Amazon•Cupertino, CA

12h

About The Position

Custom SoCs (System on Chips) are the brains behind AWS’s Machine Learning servers. Our team builds C++ & SystemC functional models of these custom-designed accelerator SoCs for use by AWS internal teams to significantly left-shift development. We’re looking for a Senior SoC Virtual Platform Modeling Engineer to join the team and deliver new functional models, infrastructure, and tooling for our customers. As part of the ML accelerator modeling team, you will: - Develop and own SoC models end-to-end, including model architecture, integration with other model or infrastructure components, testing, and debug - Work closely with architecture, RTL design, design verification, emulation, and software teams to build, debug, and deploy your models - Innovate on the tooling you provide to customers, making it easier for them to use our SoC models - Drive model and modeling infrastructure performance improvements to help our models scale - Develop software which can be maintained, improved upon, documented, tested, and reused Annapurna Labs, our organization within AWS, designs and deploys some of the largest custom silicon in the world, with many subsystems that must all be modeled and tested with high quality. Our SoC model is a critical piece of software used in both our SoC development process and by our partner software teams as a virtual platform. You’ll collaborate with many internal customers who depend on your models to be effective themselves, and you'll work closely with these teams to push the boundaries of how we're using modeling to build successful products. You will thrive in this role if you: - Are an expert in modeling SoCs, ASICs, TPUs, GPUs, or CPUs - Are comfortable modeling in C++ or SystemC, and familiar with Python - Enjoy learning new technologies, building software at scale, moving fast, and working closely with colleagues as part of a small team within a large organization - Want to jump into an ML-aligned role, or get deeper into the details of ML at the hardware/system-level Although we are building machine learning chips, no machine learning background is needed for this role. This role spans modeling of the ML and management regions of our chips, and you’ll dip your toes into both. You’ll be able to ramp up on ML as part of this role, and any ML knowledge that’s required can be learned on-the-job. This role can be based in either Cupertino, CA or Austin, TX. The broader team is split between the two sites, with a slight preference for CA, due to colocation with more customer teams. We're changing an industry. We're searching for individuals who are ready for this challenge, who want to reach beyond what is possible today. Come join us and build the future of machine learning!

Requirements

5+ years of non-internship professional software development experience
5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
Experience as a mentor, tech lead or leading an engineering team
7+ years of non-internship professional experience writing functional or performance models
Experience programming with C++ and/or SystemC
Knowledge of SoC, CPU, GPU, and/or ASIC architecture and micro-architecture
Bachelor's degree in computer science or equivalent

Nice To Haves

Experience analyzing data and applying best practices to assess performance drivers
Experience developing models that integrate with QEMU
Experience developing and calibrating performance models for custom silicon chips
Experience with PyTest and GoogleTest
Familiarity with modern C++ (11, 14, etc.)
Experience in multi-threaded programming
Experience with machine learning accelerator hardware and/or software

Responsibilities

Develop and own SoC models end-to-end, including model architecture, integration with other model or infrastructure components, testing, and debug
Work closely with architecture, RTL design, design verification, emulation, and software teams to build, debug, and deploy your models
Innovate on the tooling you provide to customers, making it easier for them to use our SoC models
Drive model and modeling infrastructure performance improvements to help our models scale
Develop software which can be maintained, improved upon, documented, tested, and reused

Benefits

health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
401(k) matching
paid time off
parental leave

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume