Senior Cloud Hardware Development Engineer, Cloud AI/ML/storage server teams

AmazonCupertino, CA
$183,000 - $247,600Onsite

About The Position

AWS Infrastructure Services is responsible for the design, planning, delivery, and operation of all AWS global infrastructure, ensuring customers have continuous access to the innovation they rely on. The AWS Hardware Engineering team designs compute and storage servers for Amazon's web services, working with leading-edge technologies to solve challenging problems and influence industry roadmaps. This role involves owning and leading the design and development of server products, collaborating with customers to understand their technical needs, and architecting solutions for large-scale deployment. You will work with an interdisciplinary team and manufacturing partners to bring servers to the data center, and oversee their quality and performance post-launch. This is a fast-paced, intellectually challenging position requiring high standards and a drive for continuous improvement.

Requirements

  • Experience in developing functional specifications, design verification plans and functional test procedures
  • Experience in server technologies such as, thermal, mechanical, power, and signal integrity
  • Bachelor's degree or above in electrical engineering, computer engineering, or equivalent
  • 5+ years of Design/Innovation, research & development, manufacturing, process, industrial engineering, or related experience
  • Experience leading process improvement, systems development, and project management
  • Experience in English-language communication skills, both written and verbal
  • 7+ years of equivalent experience
  • In depth expertise in one or more server technologies such as Thermal / Mechanical design, high speed bus design and signal integrity, failure analysis, server components (e.g. CPU, GPU, SSDs, memory), BIOS, BMC, and networking

Nice To Haves

  • Experience leading engineering teams as a mentor or tech lead, or experience with general troubleshooting/debugging of hardware
  • Experience working with technical and product stakeholders to define requirements, prioritize features, and influence product roadmaps
  • 3+ years of new hardware product development experience, e.g. server, storage, networking, or large-scale distributed systems experience
  • In-depth expertise in one or more server technologies: thermal/mechanical design, high-speed bus design and signal integrity, failure analysis, server components (CPU, GPU, SSDs, memory), BIOS, BMC, and networking
  • Experience developing and executing test procedures for mechanical or electrical systems/components
  • Experience working with ODMs/manufacturer through the product development and manufacturing lifecycle
  • Experience building predictive failure detection or proactive remediation systems at fleet scale
  • Experience with storage/compute/GPU/accelerator platforms including integration, diagnostics, or performance validation
  • Familiarity with PCIe topology, NVLink, NVMe, and accelerator interconnects
  • Experience with large-scale datacenter or cloud environments

Responsibilities

  • Lead technical solutions for complex storage and/or accelerator server and rack system architectural challenges
  • Own end-to-end system reliability, proactively identifying and resolving deficiencies before customer impact
  • Design and implement solutions to address system-level issues at large scale
  • Decompose complex server system problems (testability, reliability, diagnostics) into deliverable tasks and features
  • Apply expertise across hardware, software, system design, x86 architecture, processes, and operations
  • Collaborate with hardware, software, manufacturing, supply chain and product management teams
  • Develop and implement diagnostic tools and monitoring solutions for production systems
  • Debug complex system failures in time sensitive settings

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service