About The Position

We are seeking an experienced Data Engineer with a passion for data, for building products that are utilizing cloud native technologies to leverage Gen AI to build the next generation of people analytics solutions. In this role, you'll be instrumental in creating/maintaining/enhancing the data infrastructure that powers our data products, build the backend infrastructure including building agentic AI systems, enabling automated, intelligent decision-making across all aspects of HR. Build products that help AWS make data-driven decisions about our most valuable asset—our people. You'll be building the foundation for autonomous, intelligent HR systems that scale across one of the world's largest and most innovative organizations. The ideal candidate is expected to work closely with the business leaders, cross-partner teams including the BI teams and the Science teams to understand their data requirements and develop solutions to server their needs. You will build new end-to-end data engineering solutions with the upmost highest standards, will work with multiple stakeholders across different HR teams, and will build products to solve business needs. A successful candidate will have a passion for innovation, interest in next-generation technology, and excitement about working in a high-impact domain. The AWS People Analytics (PXT) organization provides the people-data infrastructure and insights that enable AWS to become the World's Best Employer. Our Data Engineering team is at the forefront of leveraging next-generation technologies including Gen AI to transform how AWS business and PXT leaders access data and generate actionable insights to hire, develop, and retain the most skilled and diverse group of builders on earth.

Requirements

  • 5+ years of data engineering experience
  • Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
  • Experience with data modeling, warehousing and building ETL pipelines

Nice To Haves

  • Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases)
  • Experience with AWS technologies like Redshift, S3, AWS Glue, EMR, Kinesis, FireHose, Lambda, and IAM roles and permissions
  • Knowledge of distributed systems as it pertains to data storage and computing

Responsibilities

  • Develop automated and scalable data ingestion and consumption frameworks to build and enhance curated data models that are primarily used in business critical reporting.
  • Collaborate in a cross team environment to drive projects and business initiatives to deliver results.
  • Design and build robust data pipelines that create knowledge bases feeding our Gen AI products.
  • Architect data models organized by subject matter domains to enable efficient reporting capabilities.
  • Build and deploy both batch as well as real-time data pipelines using AWS as the cloud platform running the ETL jobs.
  • Develop data pipelines to fetch internal and external datasets from different types of data sources and curate these datasets with the existing data models.
  • Engineer scalable systems that enable AI agents to autonomously access, process, and act on people data.
  • Integrate diverse data sources spanning employee attributes, workforce planning, recruiting, engagement, retention, development, benefits, and compensation.
  • Lead information architecture initiatives and establish data governance frameworks for the products being developed.
  • Build products that enable AWS business and PXT leaders to self-serve insights through AI-powered interfaces.

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service