This role is for a 1099 independent contractor focused on training large language models (LLMs) to generate production-grade code across various programming languages. The work involves comparing and ranking code snippets, refactoring AI-generated code for correctness and efficiency, and integrating feedback into the Reinforcement Learning from Human Feedback (RLHF) pipeline. The ultimate goal is to enhance the model's ability to propose, critique, and improve code, mirroring the capabilities of an expert engineer. This is a fully remote position open to contractors in accepted locations.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Part-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
11-50 employees