About the Team The Foundations TPM Team is responsible for providing the training data to advance our models’ intelligence and expand their capabilities. We build and manage some of the largest datasets in the world — spanning trillions of tokens across modalities and languages. Our work directly shapes what our models learn and teaches them new skills that hundreds of millions of people use. About the Role As a Foundations TPM, you’ll work directly with researchers to create the datasets we need to achieve AGI. You’ll work with researchers to understand what data we need and drive our efforts to acquire it. You’ll also lead the development of our data platform to power both human researchers and AI agents to search, filter, and curate massive datasets for training. We’re looking for people who want to push the frontier of AI through better data and build the next generation of tooling for working with internet-scale pipelines and training datasets. This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed