Director, Research - Evaluation & Training

Snorkel AISan Francisco, CA

About The Position

Snorkel AI is seeking a Director, Research - Evaluation & Training to lead a team of researchers focused on data evaluation, error analysis, and data valuation methods to predict model performance. This role is crucial for demonstrating the value and quality of Snorkel’s data for AI model training and evaluation. The team will analyze where current frontier models fall short, identify capability and skill gaps, and translate this understanding into recommendations for benchmarks and datasets. The position is responsible for driving Snorkel’s data design flywheel by analyzing model failures, suggesting areas for data investment, and proving the value of this data to customers.

Requirements

  • 7+ years in applied AI, ML, or research roles, with 4+ years managing technical teams.
  • A leader who has repeatedly turned research and analysis into business outcomes, and who instinctively connects technical findings to market and customer needs.
  • Strong business and market judgment in the AI/ML space — you understand the competitive and frontier-lab landscape and can prioritize accordingly.
  • Technically conversant and credible: enough depth in LLM evaluation, benchmarking, and model behavior analysis to set direction, judge experimental quality, and pressure-test results — without needing to be the deepest technical expert in the room.
  • A nose for trends: able to look across many evaluation results and failure cases and extract the signal that should drive what gets built next.
  • Excellent communication and storytelling skills, with the ability to make technical results legible and persuasive to non-research audiences.

Nice To Haves

  • Familiarity with data valuation or data attribution research is a strong plus.
  • Experience working with frontier labs, public benchmarks, or commercial AI data/eval products.

Responsibilities

  • Own a multi-quarter roadmap centered on novel evaluation, error analysis, and data valuation techniques.
  • Synthesize and share trends from model-failure analysis and benchmarking into recommendations on the datasets the community should focus on and the ones Snorkel should invest in — making this team a primary input to the company's data strategy.
  • Focus on data valuation techniques that quantify how Snorkel data meaningfully improves model performance.
  • Lead and grow a team of researchers, setting a high bar for quality, rigor and speed of execution.
  • Act as the primary bridge between the team's findings and Product, GTM, and our customers.

Benefits

  • Meaningful opportunities to shape priorities and initiatives
  • Influence key strategic decisions
  • Directly impact ongoing success
  • Support for deepening technical expertise
  • Support for exploring leadership opportunities
  • Support for learning new skills across multiple functions
  • Environment designed for growth, learning, and shared success
  • Equal employment opportunities
  • Reasonable accommodation for individuals with disabilities
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service