Data Engineering Lead

DaydreamNew York, NY
80d

About The Position

As a Data Engineering Lead at Daydream, you will be a foundational member of the team, responsible for designing and building the entire data ecosystem that fuels our AI Personal Stylist. This is a unique opportunity to solve complex technical challenges while directly shaping a product that will revolutionize how people shop online.

Requirements

  • Extensive experience building and deploying data solutions on a major cloud platform (preferable Google Cloud Platform).
  • Highly proficient with distributed data processing frameworks such as Apache Spark, Flink, or Polars.
  • Exceptional Python coding skills, with a deep understanding of writing efficient, testable, and maintainable code for data applications.
  • Expert-level SQL skills and deep experience with modern cloud data warehouses like BigQuery, Snowflake, or Redshift.
  • Hands-on experience with workflow orchestration tools like Airflow, Argo or Kubeflow.
  • Pragmatic and proactive builder who thrives in a fast-paced, autonomous startup environment, capable of driving projects from concept to production.
  • Empathetic and collaborative teammate, skilled at communicating complex technical ideas and passionate about building the reliable infrastructure that empowers your colleagues.
  • Natural leader who enjoys mentoring and developing teammates and aligning work to provide growth opportunities while ensuring priorities are aligned with broader company goals.

Responsibilities

  • Design, build, and optimize scalable, parallel data processing pipelines on Google Cloud to handle massive volumes of offline data.
  • Implement and manage large-scale LLM batch inference jobs, processing millions of data points to enrich our product catalog with sophisticated, AI-generated attributes.
  • Architect and own the data infrastructure for our Fashion Knowledge Graph, leveraging BigQuery and parallel data processing frameworks.
  • Develop and maintain robust feature generation pipelines to craft high-quality signals for both the training and inference of our machine learning models.
  • Orchestrate complex workflows of data processing jobs, implementing robust monitoring, alerting, and data quality validation systems to ensure reliability and trust in our data.
  • Collaborate closely with data science and machine learning teams to understand data requirements and deliver production-grade data solutions.
  • Champion engineering best practices, including writing clean, maintainable Python and SQL, and drive a culture of high-quality data and operational excellence.

Benefits

  • Competitive salary, equity and benefits (medical, dental, vision, 401k, etc.)
  • Flexible vacation and remote working
  • The opportunity to be part of a groundbreaking, AI-focused company
  • Collaborative work environment with a team of talented, fun-loving individuals.
  • Opportunity to learn and grow in your career while shaping the future of fashion, shopping and technology

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Clothing, Clothing Accessories, Shoe, and Jewelry Retailers

Number of Employees

11-50 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service