This role involves fine-tuning OpenAI's most advanced models on Azure, including GPT-4.1, GPT-5, and beyond, utilizing techniques such as supervised fine-tuning (SFT) and reinforcement learning methods like DPO. The successful candidate will be hands-on with the techniques shaping the future of AI, playing a key role in improving the performance and reliability of customizing advanced AI models. This includes gaining experience with fine-tuning workflows, experimenting with reinforcement learning techniques, and applying advanced optimization strategies that directly impact product quality. The position also requires working directly with the training code, debugging complex behaviors, and sharpening skills to make cutting-edge models more reliable and effective. The models developed will power critical workloads at some of the world’s leading companies. Microsoft's mission is to empower every person and every organization on the planet to achieve more, fostering a culture of inclusion built on values of respect, integrity, and accountability. This role is targeting an immediate start date.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees