The EC2 Infrastructure Services organization is responsible for ensuring the constant availability of EC2 instances, playing a crucial role in EC2's elasticity. With AI infrastructure becoming increasingly important in EC2, we are developing systems, services, and automation to manage this at scale. The Software Development Engineer will be responsible for designing, building, and maintaining cloud-based provisioning and recovery systems for AWS Trainium-based AI UltraServers. This role demands expertise in AWS services, system architecture, and collaboration with Capacity Management, Hardware Engineering, and Datacenter Operations to manage AI/ML infrastructure.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level