Data is the foundation of AI performance, and we believe model quality starts with data quality. For speech and audio models in particular, the bar for signal fidelity, consistency, and quality control is exceptionally high. We’re seeking a Machine Learning Researcher focused on audio data quality, ML data evaluation, and quality control to lead the evaluation and optimization of large-scale speech datasets used to train audio, speech, and multimodal models. This role will be responsible not only for applying existing audio quality metrics, but also for researching how audio data quality should be evaluated for machine learning systems and developing new methods, benchmarks, and evaluation frameworks that better predict downstream model performance. You will help define what “high-quality audio data” means in the context of modern ML training. That includes studying how different forms of acoustic degradation, dataset inconsistency, recording conditions, speaker variation, labeling quality, segmentation quality, and signal artifacts affect model behavior across ASR, TTS, speaker modeling, representation learning, and multimodal systems. A core part of this role will be original research and method development: designing new approaches for measuring audio data quality, validating those approaches against downstream model outcomes, and translating research insights into practical evaluation tools, filtering rules, and quality standards used across Protege’s data platform. This is an ideal role for someone deeply obsessed with audio data quality and signal understanding, comfortable operating in both research and hands-on implementation modes, and excited to help Protege become the ubiquitous platform for high-quality AI training data.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior