About the Role Scale’s mission is to develop reliable AI systems for the world's most important decisions. Within the Enterprise BU, we build production-grade GenAI applications for the world’s largest companies. For these organizations, the stakes are high: if an application isn’t useful, accurate, and safe, it cannot go into production. As a Strategic Projects Lead (SPL) for Enterprise Evaluations, you will oversee the evaluations that determine if an application is ready for the real world. You will define "what good looks like" for complex GenAI apps, curate the data needed to measure performance, and serve as one of the final gatekeepers for production readiness. This is a high-impact role for a technically curious operator who is equally comfortable debating a complex evaluation rubric with an engineer and communicating strategy to Fortune 500 customers. You must be obsessed with the gold standard for AI performance, from the high-level approach to the granular details of data quality.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed