Research Engineer, Data ($180K – $250K + Equity) at Fast-growing generative AI startup

Jack & Jill/External ATSSan Francisco, CA
23h$180,000 - $250,000

About The Position

You will lead the quality and coverage of data powering next-generation foundation models. As the in-house expert on global datasets, you'll ensure exceptional performance across dozens of languages. You will bridge the gap between research and production by building scalable systems to curate, evaluate, and steer massive multilingual data collections. Why this role is remarkable: Work at the frontier of model architecture innovation alongside founding experts from world-class AI labs. Join a well-funded team backed by top-tier VCs and industry-leading AI advisors during a high-growth phase. Directly influence the intelligence and inclusivity of global-scale models used for audio, video, and text processing.

Requirements

  • Proven experience building or working with large-scale multilingual datasets for generative models like speech or text.
  • Strong applied machine learning background with a specific focus on data-centric approaches and scalable system building.
  • Demonstrated ability to guide human annotation processes and evaluation metrics across multiple languages and cultures.

Responsibilities

  • Design and build large-scale multilingual datasets and run controlled experiments to measure their impact on model behavior.
  • Develop automated quality control systems and speech model evaluations using both manual annotation and automated metrics.
  • Implement advanced steering techniques to improve model intelligence through data and mitigate bias in generative outputs.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service