About The Position

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE: You will work on the most critical post-training and reinforcement learning challenges at any given time — including reward modeling, preference optimization (RLHF/DPO), and RL for improving reasoning, truthfulness, and real-world capabilities. You will get clarity on your first project before an offer.

Requirements

  • Belief that truth-seeking AI is the most important and challenging problem.
  • Obsession about building incredibly useful models through post-training and RL techniques.
  • Power user of AI models and eager to push the boundaries of what’s possible with reinforcement learning and alignment methods.
  • Pride in work and thriving in meritocratic environments.
  • Strong communication skills, able to concisely and accurately share knowledge with teammates.

Nice To Haves

  • Previous work on post-training, RLHF, or training models used by millions of people.

Responsibilities

  • Work on critical post-training and reinforcement learning challenges, including reward modeling, preference optimization (RLHF/DPO), and RL for improving reasoning, truthfulness, and real-world capabilities.

Benefits

  • Equity
  • Comprehensive medical coverage
  • Vision coverage
  • Dental coverage
  • Access to a 401(k) retirement plan
  • Short-term disability insurance
  • Long-term disability insurance
  • Life insurance
  • Various other discounts and perks
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service