About The Position

LILT is building a global network of domain experts to support high-quality AI evaluation across training, benchmarking, red-teaming, and ongoing model monitoring. We are seeking education and learning professionals to contribute expert judgment to human-in-the-loop AI evaluation workflows used by leading enterprises and hyperscalers. This role is designed for professionals who understand how educational content, learning experiences, and instructional systems work in real-world academic and professional learning environments and who can apply that expertise to evaluate, assess, and improve multilingual AI systems. Your contribution of expertise will directly influence multilingual AI model quality, safety, and deployment readiness. This role includes two distinct expert tracks, based on experience level and scope of responsibility. Track A: EdTech AI Rater Raters execute structured evaluation tasks using clearly defined rubrics and instructions. Track B: EdTech AI Evaluator (Senior Track) Evaluators provide higher-level domain oversight and help shape how evaluation is performed. AI is changing how the world communicates — and LILT is leading that transformation. LILT's mission is to make the world's information available to everyone, no matter the language they speak. Join our global community who thrive on innovation and excellence. Our collective knowledge, uniqueness, and skills deliver multilingual AI and human-verified services to Enterprises, Governments, and AI Developers around the world. Earn money. Have fun. Advance human knowledge. Work on diverse projects from anywhere, any time you want. Get paid quickly and fairly, and build your professional network in a supportive community—all through a streamlined application process tailored to your expertise.

Requirements

  • Educators, instructional designers, curriculum developers, or learning professionals
  • Experience with teaching, curriculum design, assessment, or educational technology
  • Strong attention to detail and comfort working with structured evaluation criteria
  • Senior educators, academic leaders, learning scientists, or education subject matter experts
  • Experience defining instructional standards, reviewing complex edge cases, or advising on learning outcomes
  • Ability to clearly explain nuanced pedagogical reasoning and tradeoffs
  • Deep domain expertise in education, instructional design, or learning sciences
  • Strong judgment and ability to apply criteria consistently
  • Comfort working with structured evaluation workflows
  • Ability to explain reasoning clearly, especially in instructional or learner-facing scenarios
  • Reliability, professionalism, and respect for quality standards
  • Native or professional fluency in one or more supported languages is required
  • Supported languages span 30+ global languages (list provided during screening)
  • English fluency is required for guidelines, feedback, and collaboration

Responsibilities

  • Evaluate AI outputs related to educational, instructional, and learning content
  • Perform structured scoring, comparison, classification, and judgment tasks
  • Assess pedagogical accuracy, clarity, appropriateness, and learning effectiveness
  • Identify hallucinations, misleading explanations, factual errors, or unsafe educational guidance
  • Apply domain-specific education and instructional guidelines consistently across tasks
  • Validate and refine evaluation rubrics and edge-case handling
  • Perform adjudication where raters disagree
  • Conduct error analysis and qualitative reviews of model behavior
  • Partner with LILT research, product, and customer teams on evaluation design
  • Support red-teaming, educational quality review, and model readiness assessments
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service