About The Position

The EC2 Infrastructure Services organization is responsible for ensuring the constant availability of EC2 instances, playing a crucial role in EC2's elasticity. With AI infrastructure becoming increasingly important in EC2, we are developing systems, services, and automation to manage this at scale. The Software Development Engineer will be responsible for designing, building, and maintaining cloud-based provisioning and recovery systems for AWS Trainium-based AI UltraServers. This role demands expertise in AWS services, system architecture, and collaboration with Capacity Management, Hardware Engineering, and Datacenter Operations to manage AI/ML infrastructure.

Requirements

  • 3+ years of non-internship professional software development experience.
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience.
  • 1+ years of software development engineer or related occupational experience.
  • 1+ years of designing and developing large-scale, multi-tiered, multi-threaded, embedded or distributed software applications, tools, systems, and services using: C#, C++, Java, or Perl experience.
  • 1+ years of Object Oriented Design experience.
  • Bachelor's degree or foreign equivalent in Computer Science, Engineering, Mathematics, or a related field.
  • Experience programming with at least one software programming language.

Nice To Haves

  • 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience.
  • Bachelor's degree in computer science or equivalent.

Responsibilities

  • Building and maintaining scalable microservices.
  • System design that efficiently solves business problems.
  • Working in environments where the technology strategy is defined but the solution design is not.
  • Building cloud-based solutions using AWS native services for scaling infrastructure frameworks.
  • Creating observable systems with appropriate metrics and alarming.
  • Collaborating with customers and stakeholders to convert business needs into technical designs.
  • Participating in code reviews and technical assessments.

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service