Senior Applied Scientist

Hippocratic AI•Palo Alto, CA

50d•Onsite

About The Position

LLM post-training is where raw capability becomes reliable, safe behavior — and in healthcare, the stakes are as high as they get. You'll own the RL-based post-training pipeline end-to-end to improve our models' clinical reasoning, safety, and alignment. Your models will be deployed to interact with millions of patients across diverse clinical use cases.

Requirements

MS or PhD in CS or relevant field
4+ years or experience in NLP, LLM training, RL, or general ML
1+ years experience in RL for LLM post-training
Experience with large-scale (50B+ parameter and multi-node) LLM training
Strong Python and PyTorch coding skills
Experience with RLHF, RLVR, LLM-as-judge or similar methods for LLM post-training

Nice To Haves

Publications at top venues (NeurIPS, ICML, ICLR, ACL, EMNLP)
Healthcare domain experience

Responsibilities

Design and iterate on RL-based post-training methods (RLHF, RLVR, DPO, and beyond)
Build and evaluate reward models, verifiers, and LLM-as-judge pipelines
Develop conversational AI environments and simulations for healthcare RL training with synthetic data
Run rigorous experiments to understand what drives post-training gains
Collaborate with research, engineering, and clinical teams

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume