About The Position

Amazon People Insights & Experience (APIX) is transforming Amazon's fragmented people-data ecosystem into a centralized, AI-ready platform through the PXT Data Strategy. This multi-year program consolidates over 20 data lakes with 13,000+ data sources into a unified Central Lakehouse serving 440+ data teams. As a Senior Data Engineer, you will be instrumental in building the data infrastructure that enables self-service analytics, AI-powered insights, and data governance at a massive scale. This is a hands-on technical leadership position where you will own team-level data architecture for flagship initiatives including the Central Lakehouse, Amazon Cortex, and Clarity Metrics Marketplace (CMM). You will architect solutions that serve over 16,000 HR professionals, operations leaders, and people managers across Amazon, directly impacting how the company makes data-driven workforce decisions. We are looking for a top data engineer with deep expertise in distributed systems, data lake architectures, and a proven track record of delivering large-scale data solutions. You should excel at technical leadership, strategic thinking, and have a genuine passion for building data infrastructure that scales to support hundreds of teams building metrics in parallel while maintaining Amazon's highest privacy and security standards.

Requirements

  • 5+ years of data engineering experience
  • Experience with data modeling, warehousing and building ETL pipelines
  • Experience with SQL
  • Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
  • Experience mentoring team members on best practices

Nice To Haves

  • Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
  • Experience operating large data warehouses

Responsibilities

  • Take ownership of team data architecture with system-wide perspective, anticipating data access patterns and proactively removing bottlenecks across the Central Lakehouse, Cortex, and CMM platforms.
  • Design and deliver exemplary, large-scale data solutions that are secure, maintainable, scalable, and extensible—enabling others to easily contribute and build upon your work.
  • Lead architectural improvements that simplify complex data systems, addressing deficiencies where your team's architecture bottlenecks other teams across PXT's 20 data lakes.
  • Make appropriate architectural trade-offs (build vs. buy, tiered storage strategies, data abstraction patterns) balancing short-term technology needs with long-term business requirements for Amazon's people data ecosystem.
  • Work efficiently with limited guidance in ambiguous problem areas—where business problems are defined but technical strategies for Golden Dataset onboarding, metadata enrichment, and AI contextualization are not.
  • Lead identification and resolution of complex data engineering challenges including data duplication across 264 redundant warehouses, inconsistent metric definitions, and governance gaps across federated data lakes.
  • Influence team technical and business strategy for PXT Data Strategy workstreams, bringing perspective and context for current and future technology choices in AWS-first data platform adoption.
  • Build consensus when confronted with discordant views on data architecture approaches, demonstrating judgment on when to leverage existing solutions versus building new capabilities.
  • Design and implement scalable data pipelines, ETL processes, and data abstraction layers supporting the Central Lakehouse (1,754+ Golden Datasets), Cortex Data Plane APIs, and self-service CMM capabilities.
  • Architect solutions handling high volumes of people data across 17,000+ applications, optimizing for data quality, availability, latency, security, performance, and integrity.
  • Reduce manual data preparation effort by 60-80% through intelligent data vending, contextualized metadata, and automated dataset onboarding workflows.
  • Deliver data infrastructure supporting AI-powered insights (Clarity Assist, Quick Suite integration) with >90% query accuracy and <7 day metric creation timelines.
  • Set and enforce standards for data discovery, naming conventions, operational excellence, data security, and code quality across PXT data engineering teams.
  • Lead implementation of systematic governance through integration with FPDS primitives (DISAPERE, Maple, UBX), enabling policy-driven data classification, automated depersonalization, and cell-level access control.
  • Collaborate with AWS BDT, Security, and FPDS teams to influence roadmaps for SageMaker Unified Studio (SMUS), Andes External Tables, and Quick Suite integration—addressing 95+ identified feature gaps.
  • Ensure all data solutions comply with Amazon's privacy standards, GDPR/DSAR requirements, and Red certification processes for sensitive people data.
  • Actively mentor and coach data engineers and analysts across the organization, improving technical knowledge of distributed systems, data lake architectures, and AWS data services.
  • Provide technical assessments and guidance for DE II and DE III promotion candidates, helping team members grow their careers.
  • Lead design reviews for your team's data architecture and actively participate in design reviews of related software and data systems across PXT.
  • Demonstrate technical influence over 1-2 teams through collaborative development efforts and increasing productivity through data engineering best practices.
  • Master the constantly evolving AWS data toolkit including Andes, Athena, Glue, Redshift, SageMaker Unified Studio, and Quick Suite—adopting AWS-first approaches while retiring bespoke solutions.
  • Evaluate and integrate emerging technologies for data lake management, GenAI contextualization (Model Context Protocols, vector embeddings), and serverless data engineering patterns.
  • Pioneer privacy-first architecture patterns and AI-ready data infrastructure that positions PXT as AWS QuickSight's #1 customer and establishes foundations for external AWS product offerings.

Benefits

  • sign-on payments
  • restricted stock units (RSUs)
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service