Amazon Web Services is seeking an extraordinary Software Development Engineer with a Data Engineering background to join the AWS Data Platform team. Our mission is to democratize access to trusted AWS business data — cataloging, governing, enriching, and brokering data through its lifecycle. We manage the core enterprise data infrastructure and curate foundational datasets from across AWS services, transforming raw data into actionable information that provides visibility into the state of the AWS business. We support the full data lifecycle — ingestion, transformation, cataloging, governance, and consumption — at massive scale. We process trillions of events per month using stream processing (Kinesis), billions of line items via distributed compute (EMR and Spark), and store petabytes of data in open table formats (Apache Iceberg) on S3 Tables and Redshift. Our purpose-built tooling handles data ingestion, cataloging and discovery, governance, and self-service query access for internal service owners, analysts, scientists, and AI agents. We are increasingly leveraging generative AI and semantic layer technologies to make data more discoverable and queryable — enabling natural-language access to datasets and AI-powered recommendations that surface actionable intelligence for our data consumers. Because we sit at the nexus of all AWS services, we work closely across teams to ensure a great consumer experience. You will have the ability to craft and build AWS's data platform and supporting systems for years to come — working at a scale where engineering decisions have meaningful, far-reaching impact.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level