About The Position

This role is for a Cloud Hardware Dev Engineer within AWS Hardware Engineering Services, focusing on building the backbone of Generative AI cloud at AWS, specifically for AI training and inference. The position involves delivering continuous price performance improvements for AI model training for multi-billion variable LLMs. As part of the AWS Utility Computing (UC) organization, the engineer will contribute to product innovations in foundational services like Amazon S3 and Amazon Elastic Compute Cloud (EC2), and new product innovations that differentiate AWS. The UC organization develops and manages Compute, Database, Storage, Internet of Things (IoT), Platform, and Productivity Apps services, including specialized security solutions. The role involves joining a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, and operations managers, collaborating across AWS to ensure high standards for safety and security, providing seemingly infinite capacity at the lowest possible cost for customers, and fostering an inclusive culture.

Requirements

  • Experience working with interdisciplinary teams to execute product design from concept to production.
  • Experience developing and executing test procedures for mechanical or electrical systems/components based on design intent and approved equipment submissions.
  • Knowledge of server hardware and components.
  • Bachelor's degree in electrical engineering or equivalent.
  • 2+ years of server hardware troubleshooting and repair experience.
  • 4+ years of hardware design and validation of components, subsystems and systems experience.

Nice To Haves

  • Master's degree or above in electrical engineering, computer engineering, or equivalent.
  • Experience in compute and storage server architecture and design for large scale applications.
  • AI infrastructure hardware development and debugging experience.

Responsibilities

  • Own and lead the design, development and root cause of a new segment of accelerated servers.
  • Work closely with customers to understand their technical needs and business goals.
  • Leverage experience with server design and the knowledge of various teams to architect the solutions that will be deployed at scale.
  • Work with an interdisciplinary team of component, firmware, test, qualification, and integration engineers.
  • Lead design and manufacturing partners to bring these servers to the data center.
  • Oversee the fleet of servers developed, monitoring their quality and how they are meeting the customer requirements.
  • Interface with internal and external customers to understand project requirements and facilitate system development on top of server design.
  • Learn operational challenges to existing fleet with the goal of improving the current customer experience.
  • Develop improved systems for future designs.
  • Work directly with vendors and ODM/JDM design teams to develop and manufacture products at scale.

Benefits

  • Sign-on payments
  • Restricted stock units (RSUs)
  • Health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • Paid time off
  • Parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service