Data Lead

CartesiaSan Francisco, CA
100d

About The Position

Data is the lifeblood of our models, and we are looking for a Data Lead to own the strategy and execution for all data at Cartesia. This is a critical leadership role, where you will be responsible for building and managing the datasets that power our cutting-edge research. You will lead a talented team of data engineers and specialists to acquire, process, and curate massive multimodal datasets. Your vision will directly shape the capabilities and quality of our foundational models.

Requirements

  • Technical expertise in large-scale data engineering.
  • Familiarity with building datasets for and evaluating generative models.
  • Leadership skills to grow and guide a high-impact data team.

Responsibilities

  • Define Cartesia’s overall multi-modal data strategy across pre-training and post-training, including human, synthetic, and web-scale data sources.
  • Lead, manage, and mentor a team.
  • Design and oversee the construction of robust, scalable data pipelines for text, audio, and video.
  • Establish and enforce rigorous standards for data quality across the organization.
  • Deeply understand how data affects model capability and proactively identify and source novel datasets.
  • Manage relationships and budgets with external data vendors and partners.

Benefits

  • Lunch, dinner and snacks at the office.
  • Fully covered medical, dental, and vision insurance for employees.
  • 401(k).
  • Relocation and immigration support.
  • Your own personal Yoshi.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service