This role involves training large language models (LLMs) to generate production-grade code across various programming languages. Key tasks include comparing and ranking code snippets, providing detailed explanations for preferred approaches, and refactoring AI-generated code to ensure correctness, efficiency, and adherence to style guidelines. You will also be responsible for integrating feedback, such as ratings, edits, and test results, into the Reinforcement Learning from Human Feedback (RLHF) pipeline to continuously improve the model's code generation capabilities. The ultimate goal is to enable the AI model to propose, critique, and refine code in a manner consistent with expert human engineers.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Part-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
11-50 employees