Cantina Labs is a social AI company focused on developing advanced real-time models for expression, personality, and realism to bring characters to life. They aim to transform how people tell stories, connect, and create by building and powering ecosystems. The company is seeking an ML Engineer specializing in Data Quality to manage the datasets crucial for their speech systems. This role involves hands-on work with audio and text data, including auditing, denoising, filtering, labeling, and developing tools and models to transform raw data into reliable training corpora for Text-to-Speech (TTS) and related tasks. The engineer will establish data quality metrics and classifiers, manage human-in-the-loop annotation programs, and implement quality gates within training and evaluation pipelines. The objective is to directly enhance model performance, robustness, and cost-efficiency by managing the data aspect of the model-data-evaluation cycle.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
11-50 employees