About The Position

Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the cloud for AI training and inference? Want to do industry leading work delivering continuous price performance improvements in the cloud for AI model training for multi billion variable LLMs? Come Join us in designing, delivering and operating AWS cloud offerings that enable high performance and scalability in AI/ML and HPC workloads. AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help. You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.

Requirements

  • Bachelor's degree in electrical engineering, computer engineering, or equivalent
  • Experience in developing functional specifications, functional test procedures and troubleshooting
  • Experience in server technologies such as, thermal, mechanical, power, and signal integrity

Nice To Haves

  • Experience in compute and storage server architecture and design for large scale applications
  • 4+ years hardware development with a focus on system / server development in compute and/or storage server architecture, design and troubleshooting

Responsibilities

  • Own and lead the design, development and root cause of a new segment of accelerated servers.
  • Work closely with our customers to understand their technical needs and business goals, leveraging your experience with server design and the knowledge of various teams to architect the solutions that we will deploy at scale.
  • Work with an interdisciplinary team of component, firmware, test, qualification, and integration engineers, and lead our design and manufacturing partners to bring these servers to the data center.
  • Oversee the fleet of servers you develop, monitoring their quality and how they are meeting the customer requirements.
  • Interface with our internal and external customers to understand project requirements and facilitate system development ontop of your server design.
  • Be responsible for learning operational challenges to our existing fleet with the goal of improving the current customer experience as well as developing improved systems for future designs.
  • Work directly with vendors and ODM/JDM design teams to develop and manufacture your product at scale.

Benefits

  • Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits.
  • medical
  • financial
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service