We’re hiring a Senior Data Engineer to own data at truly massive scale. You’ll design and run pipelines that clean, enrich, and serve data spanning hundreds of attributes across 80M+ companies and 800M+ people. The role blends classic data engineering with data operations, vendor/BPO orchestration, and data partnerships. Core stack : Python, Dagster, DuckDB Pipelines at scale : Building resilient ELT/ETL with strong contracts, idempotency, and lineage. Data operations : Set quality bars, manage BPO workflows, and run SLAs with external data partners. Serving & access : Position data for production use from serving infrastructure, documentation, and SLAs for internal consumers. Cost & performance : You tune storage/compute and keep a sharp eye on unit economics. Opinionated: Deep level of understanding of the technological landscape, making both high level system and granular code design decisions based on understanding rather than preference - diving deep on unknown patterns in order to build the best product.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed