Amazon-posted 10 days ago
Full-time • Manager
Cupertino, CA
5,001-10,000 employees

Amazon Web Services (AWS) Hardware Engineering team creates compute and accelerator server designs for Amazon’s web services. Our engineers work with leading edge technologies, solve challenging problems, influence the industry’s roadmaps, and develop new and unique solutions that are ahead of the pack. We work in an environment that fosters innovation and creativity. We encourage and invest in new directions and new ideas that will serve our customers better. It is because of the team and our constant focus on customer that we are able to develop creative and new designs that set the standards on performance, quality, cost, and operational excellence. You’ll join a diverse team of hardware engineers, software engineers, system engineers, technical program managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. AWS Hardware Engineering is looking for a customer-obsessed, team-driven technology leader to take our engineering of server hardware and software to the next level. As a System Development Manager, you will work on Amazon’s hardest problems, driving the team that owns the designs and operations for compute and accelerator-based products. You will manage a mix of software and system engineers that are pushing the technology envelope with novel inventions to design and validate products at scale. You will understand the current infrastructure and come up with plans to improve the tools and processes for designing, validating and monitoring fleet performance of the products. Interacting with internal interdisciplinary teams, you will oversee the platform development efforts and the test content and work with the component partner teams to drive the requirements for the platform. You will oversee the system integration, development and testing for the tools that validate sub systems(CPU/memory/PCIe/power etc). You will standardize testing and define telemetry and alarming for the platforms the team designs. Maintaining healthy vendor relationships is critical to stay ahead of the curve, including proactive communication and rapidly resolving issues. This is a fast-paced, intellectually challenging position.

  • Driving the team that owns the designs and operations for compute and accelerator-based products
  • Managing a mix of software and system engineers that are pushing the technology envelope with novel inventions to design and validate products at scale
  • Understanding the current infrastructure and come up with plans to improve the tools and processes for designing, validating and monitoring fleet performance of the products
  • Interacting with internal interdisciplinary teams, you will oversee the platform development efforts and the test content and work with the component partner teams to drive the requirements for the platform
  • Overseeing the system integration, development and testing for the tools that validate sub systems(CPU/memory/PCIe/power etc)
  • Standardizing testing and define telemetry and alarming for the platforms the team designs
  • Maintaining healthy vendor relationships is critical to stay ahead of the curve, including proactive communication and rapidly resolving issues
  • 7+ years of systems engineering and operations leadership for an Internet service or leading edge IT organization experience
  • 5+ years of managing system or software development teams experience
  • 7+ years of relevant hands-on systems engineering and administrative work in networking, storage systems, operating systems experience
  • Bachelor's degree in Computer Science, Engineering, Mathematics, or a related field
  • Experience in systems engineering and operations leadership for an Internet service or leading-edge IT organization
  • Experience in managing system or software development teams
  • Experience (hands-on) in systems engineering and administrative work in networking, storage systems, and operating systems
  • Experience in agile software development methodology
  • Knowledge of professional software engineering & best practices for full software development life cycle, including coding standards, software architectures, code reviews, source control management, continuous deployments, testing, and operational excellence
  • Knowledge of systems engineering fundamentals (networking, storage, operating systems)
  • Experience with Agile engineering practices (Kanban, continuous delivery, etc.)
  • Experience with AWS platforms, services, and design patterns
  • equity
  • sign-on payments
  • medical
  • financial
  • other benefits
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service