Research Engineer - Agency and Reasoning

ZyphraSan Francisco, CA
Onsite

About The Position

As a Research Engineer - Agency and Reasoning, you will be a core contributor to Zyphra’s Agency and Reasoning Team. You will be involved with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at scale to our next generation of language models.

Requirements

  • Strong research taste and intuition
  • The ability to work through a research project from conception to execution to write-up
  • Strong implementation and prototyping skillset
  • A researcher who can take an idea from conception to experimentation extremely quickly
  • The ability to work well and cooperate with others in a high-paced research setting
  • Curiosity, interest, and joy in understanding intelligence.

Nice To Haves

  • Experience and aptitude with reinforcement learning, either in the context of language model reasoning or more classical RL tasks
  • Experience with language-model-supervised fine-tuning and preference-learning methods, such as DPO and simPO.
  • Experience with context-length extension methods
  • A good intuitive ability to understand model behaviors and correct them through iterative fine-tuning
  • Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation
  • Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics)
  • Previously published machine learning research in well-respected venues
  • Highly proficient with PyTorch and Python
  • We are excited and able to rapidly learn new fields and implement new ideas
  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale

Benefits

  • Comprehensive medical, dental, vision, and FSA plans
  • Competitive compensation and 401(k) plan
  • Relocation and immigration support on a case-by-case basis
  • In-office snacks and meals provided
  • Unlimited PTO and company holidays
  • In-person team in San Francisco with a collaborative, high-energy environment
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service