Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. This opportunity involves building a dataset to evaluate AI coding agents by creating challenging tasks and evaluation criteria within realistic simulated environments. You will build virtual companies, assemble and calibrate tasks, design tasks in isolated environments, write tests, iterate with AI agents, review code, and analyze agent performance. This role is not data labeling, prompt engineering, or writing code from scratch, but rather guiding and evaluating AI-generated code.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Part-time
Career Level
Mid Level