Senior Data Engineer III

SHEINUnited States,
$183,360 - $205,000Onsite

About The Position

SHEIN TECHNOLOGY LLC is seeking a Sr. Data Engineer III in San Diego, CA to collaborate with global teams across data, security, infrastructure, and business functions to analyze data requirements and design scalable data engineering solutions. Design, develop, and maintain efficient and scalable data pipelines to extract, transform, and load (ETL) data across distributed systems. Apply data validation and quality assurance techniques to ensure the accuracy, consistency, and completeness of data throughout data processing workflows. Analyze and optimize data pipelines and processing jobs for performance, scalability, and reliability by identifying and addressing system-level inefficiencies. Ensure data integrity, security, privacy, and high availability through appropriate data modeling, access controls, and system architecture design. Monitor data pipelines and distributed data processing systems to identify abnormal behavior, diagnose technical issues, and implement corrective actions in production environments. Perform technical root cause analysis of data processing issues and collaborate with cross-functional teams to implement long-term, preventative solutions. Develop and maintain technical documentation for data pipeline designs, system architectures, and operational procedures, and communicate technical updates to stakeholders. Participate in a rotational on-call schedule to provide engineering-level support for critical data systems, ensuring production stability and reliability.

Requirements

  • Bachelor’s degree or a foreign equivalent in Applied Data Science, Computer Science, or a related field, plus 4 years of post-baccalaureate experience in job offered or Data Engineering related job titles.
  • 4 years of experience in building and optimizing large-scale, distributed data pipelines with Hive, Presto, Spark, or Flink.
  • 4 years of experience in data warehousing, including dimensional modeling, star/snowflake schema design, and normalization/denormalization strategies in large-scale data warehouses including Amazon Redshift.
  • 4 years of experience in writing and optimizing complex SQL queries for large datasets, creating joins, aggregations, and subqueries, in the context of querying data warehouses.
  • 4 years of experience in data storage solutions, including S3 on AWS.
  • 4 years of experience in cloud-native services including AWS EMR, AWS S3.
  • 4 years of experience using workflow orchestration tools including Airflow in a production environment to automate, schedule, monitor and tune, data pipelines.

Responsibilities

  • Collaborate with global teams across data, security, infrastructure, and business functions to analyze data requirements and design scalable data engineering solutions.
  • Design, develop, and maintain efficient and scalable data pipelines to extract, transform, and load (ETL) data across distributed systems.
  • Apply data validation and quality assurance techniques to ensure the accuracy, consistency, and completeness of data throughout data processing workflows.
  • Analyze and optimize data pipelines and processing jobs for performance, scalability, and reliability by identifying and addressing system-level inefficiencies.
  • Ensure data integrity, security, privacy, and high availability through appropriate data modeling, access controls, and system architecture design.
  • Monitor data pipelines and distributed data processing systems to identify abnormal behavior, diagnose technical issues, and implement corrective actions in production environments.
  • Perform technical root cause analysis of data processing issues and collaborate with cross-functional teams to implement long-term, preventative solutions.
  • Develop and maintain technical documentation for data pipeline designs, system architectures, and operational procedures, and communicate technical updates to stakeholders.
  • Participate in a rotational on-call schedule to provide engineering-level support for critical data systems, ensuring production stability and reliability.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service