Senior Applied Scientist

Hippocratic AIPalo Alto, CA
Onsite

About The Position

LLM post-training is where raw capability becomes reliable, safe behavior — and in healthcare, the stakes are as high as they get. You'll own the RL-based post-training pipeline end-to-end to improve our models' clinical reasoning, safety, and alignment. Your models will be deployed to interact with millions of patients across diverse clinical use cases.

Requirements

  • MS or PhD in CS or relevant field
  • 4+ years or experience in NLP, LLM training, RL, or general ML
  • 1+ years experience in RL for LLM post-training
  • Experience with large-scale (50B+ parameter and multi-node) LLM training
  • Strong Python and PyTorch coding skills
  • Experience with RLHF, RLVR, LLM-as-judge or similar methods for LLM post-training

Nice To Haves

  • Publications at top venues (NeurIPS, ICML, ICLR, ACL, EMNLP)
  • Healthcare domain experience

Responsibilities

  • Design and iterate on RL-based post-training methods (RLHF, RLVR, DPO, and beyond)
  • Build and evaluate reward models, verifiers, and LLM-as-judge pipelines
  • Develop conversational AI environments and simulations for healthcare RL training with synthetic data
  • Run rigorous experiments to understand what drives post-training gains
  • Collaborate with research, engineering, and clinical teams
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service