About The Position

Andela is transitioning from a world-class talent marketplace into a high-scale, AI-integrated Talent Cloud. In this Senior Data Engineer role, you will be forward deployed with an enterprise-scale organization in the automotive industry undergoing a significant transformation in how it leverages data and analytics to drive commercial and operational decisions. They are investing in modern data infrastructure, advanced analytics, and AI/ML capabilities to improve performance across key business areas. The environment emphasizes strategic thinking, cross-functional collaboration, and the ability to translate complex data into actionable insights. This role requires someone who enjoys working with customers, thinks about data as a product, ensures data is reliable, accessible, and structured to enable self-service analytics across the organization.

Requirements

  • 8+ years in data engineering on cloud platforms
  • Snowflake — data modelling, query optimisation, staging environments
  • Python — pandas, PySpark, data pipeline scripting
  • Experience building feature stores for ML consumption
  • Strong understanding of schema design and dimensional modelling

Nice To Haves

  • Experience in automotive, retail, or dealer network data
  • Familiarity with CRM data structures (for Aftermarket hire)
  • Azure — Data Factory, Blob Storage, or Synapse
  • Apache Airflow or similar orchestration tooling
  • Azure DevOps for pipeline CI/CD

Responsibilities

  • Build and maintain Snowflake data pipelines for Dealer 360 and Aftermarket workstreams respectively
  • Design and implement the dealer and aftermarket feature stores (Layer 1–2)
  • Build ingestion pipelines for all external data sources (JD Power, PIN, S&P, Vehicle Registration, competitive scraping + 2 TBC)
  • Write and maintain dbt models for data transformation, cleaning, and normalisation
  • Enforce schema validation, data quality checks, and freshness SLAs across all feeds
  • Collaborate with the Data Architect to implement the unified data model
  • Produce documented data lineage for every pipeline before any model is trained against it
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service