This role involves training large language models (LLMs) to generate production-grade code across various programming languages. Key tasks include comparing and ranking code snippets, explaining their merits, and repairing/refactoring AI-generated code for correctness, efficiency, and style. The position also requires injecting feedback (ratings, edits, test results) into the Reinforcement Learning from Human Feedback (RLHF) pipeline to ensure its smooth operation. The ultimate goal is to teach the model to propose, critique, and improve code in a manner similar to an expert engineer. This is an independent contractor role, open to applicants in accepted locations only, and is not compatible with visa statuses requiring W-2 employment or employer sponsorship.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Part-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
11-50 employees