Software Development Engineer II, Intelligent Cloud Hosting (ICON)

AmazonSeattle, WA
$143,700 - $194,400Onsite

About The Position

Amazon's Intelligent Cloud Hosting (ICON) team is looking for a Software Development Engineer (SDE) to join our team. ICON is responsible for the reliability and operational excellence of Amazon's cloud hosting infrastructure, supporting all of Amazon's global marketplaces, partner portals, and consumer experiences including Kindle, Alexa, Amazon Video, and the Mobile Application. The team builds intelligent systems that proactively detect, diagnose, and resolve incidents across hundreds of thousands of services powering one of the world's largest distributed architectures. The challenges SDEs solve on this team are high-impact and mission-critical. The team is building AI-powered incident response systems that automatically investigate production issues, identify root causes from metrics, logs, and deployment events, and recommend mitigations to on-call engineers. These systems operate at massive scale, processing thousands of signals per investigation and reducing mean-time-to-resolution for critical production incidents. As an SDE II on the team, you will: Design and build production generative AI workflow that automate incident investigation workflows, from alert ingestion through root-cause analysis to mitigation recommendations. Work on tier-1, multi-tenant, high-performance systems built on AWS services (Step Functions, Bedrock, DynamoDB, Athena) with technical challenges unique to this kind of scale and throughput. Build developer productivity and operational tooling including orchestration, predictive analytics, automated diagnosis, and self-healing systems. The team is looking for engineers who are passionate about applying generative AI and machine learning to operational problems, thrive in ambiguous environments, and want to build systems that keep Amazon's infrastructure running for millions of customers worldwide.

Requirements

  • 3+ years of non-internship professional software development experience
  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience programming with at least one software programming language

Nice To Haves

  • 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
  • Bachelor's degree in computer science or equivalent

Responsibilities

  • Design and build distributed systems and automation in a large-scale cloud environment that supports millions of customers globally
  • Develop scalable services and tools on AWS that process high volumes of operational data to drive better decision-making
  • Solve broadly defined problems from design to delivery, balancing speed with long-term technical quality
  • Collaborate with engineers, scientists, and product managers to scope projects and ensure deliverables meet a high quality bar
  • Evaluate and apply emerging technologies, including generative AI and machine learning, to solve real-world operational challenges
  • Work in an agile environment delivering high-quality software with a strong focus on operational excellence, security, and availability
  • Design and build production generative AI workflow that automate incident investigation workflows, from alert ingestion through root-cause analysis to mitigation recommendations
  • Work on tier-1, multi-tenant, high-performance systems built on AWS services (Step Functions, Bedrock, DynamoDB, Athena) with technical challenges unique to this kind of scale and throughput
  • Build developer productivity and operational tooling including orchestration, predictive analytics, automated diagnosis, and self-healing systems

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service