Research Scientist - Agency and Reasoning

ZyphraPalo Alto, CA
3dOnsite

About The Position

Zyphra is an artificial intelligence company based in Palo Alto, California. The Role: As a Research Scientist , you will be a core contributor to Zyphra’s Agency and Reasoning Team. You will be involved with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at scale to our next generation of language models. What We’re Looking For: Strong research taste and intuition The ability to work through a research project from conception to execution to write-up Strong implementation and prototyping skillset A researcher who can take an idea from conception to experimentation extremely quickly The ability to work well and cooperate with others in a high-paced research setting Curiosity, interest, and joy in understanding intelligence.

Requirements

  • Experience and aptitude with reinforcement learning, either in the context of language model reasoning or more classical RL tasks
  • Experience with language model supervised finetuning and preference learning methods such as DPO, simPO, etc.
  • Experience with context-length extension methods
  • A good intuitive ability to understand model behaviors and correct them through iterative fine-tuning
  • Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation
  • Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics)
  • Previously published machine learning research in well-respected venues
  • Highly proficient with PyTorch and Python
  • We are excited and able to rapidly learn new fields and implement new ideas
  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale

Benefits

  • Comprehensive medical, dental, vision, and FSA plans
  • Competitive compensation and 401(k)
  • Relocation and immigration support on a case-by-case basis
  • On-site meals prepared by a dedicated culinary team; Thursday Happy Hours
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service