NewsBreak is seeking a hands-on Machine Learning Engineer to lead the post-training of their large language models, with a primary focus on reinforcement learning (RL). This role involves owning the entire post-training stack, including continuous pre-training (CPT), supervised fine-tuning (SFT), and RL, as well as the data preparation required for these stages. The engineer will collaborate directly with product and business teams to translate real-world use cases into training objectives and rapidly implement model improvements. This is a high-ownership position for an individual with practical experience in training models.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed