About The Position

We’re looking for a curious, creative, and practical Data Scientist who will focus on AI Evaluation and Prompt Engineering to help drive our next generation of AI-powered legal research tools. This role focuses on applying data science techniques to evaluate and improve large language model (LLM) systems, build data-driven insights, and shape our product roadmap. You’ll work closely with product managers, data engineers, and subject matter experts to make sense of large collections of legal and corporate documents (such as SEC filings). You’ll help us ask better questions, run smarter experiments, and turn data into clear, actionable recommendations.

Requirements

  • 3–6 years of experience in data science, applied NLP, or AI product analytics, preferably within a SaaS or research-heavy product environment.
  • Strong proficiency in Python and data analysis libraries such as Pandas; solid working knowledge of SQL.
  • Ability to design and evaluate LLM-based systems (e.g., RAG pipelines, prompt evaluations, output scoring), even if not specialized in deep learning.
  • Experience with data exploration, experimentation, and reporting — from defining metrics to visualizing and interpreting results.
  • Comfort working with document-based datasets (e.g., text corpora, metadata, embeddings) and understanding information retrieval / semantic search concepts.
  • Excellent written and verbal communication skills — able to present complex ideas simply and persuasively across distributed teams.
  • Proven ability to self-direct, learn new tools and concepts quickly, and apply them pragmatically.
  • Strong sense of curiosity, patience, and collaboration — especially in working across different disciplines and cultures.

Nice To Haves

  • Familiarity with tools like LangChain, OpenAI API, Databricks, or similar LLM/AI development environments.
  • Experience designing evaluation frameworks or product analytics systems for AI-driven products.
  • Prior exposure to legal, financial, or corporate document datasets.
  • Experience mentoring or informally guiding junior technical teammates.

Responsibilities

  • Evaluate and tune LLM-powered features, such as prompt optimization, retrieval-augmented generation (RAG) systems, and semantic search performance.
  • Design and execute experiments to measure model quality, reliability, and user impact — translating technical findings into product recommendations.
  • Develop and maintain data pipelines for evaluating, tracking, and improving system performance (e.g., accuracy, latency, cost, and relevance metrics).
  • Analyze structured and unstructured datasets (e.g., product usage logs, document metadata, LLM outputs) to identify patterns, insights, and areas for optimization.
  • Collaborate with product managers to translate product goals into measurable data science questions, propose next steps, and inform roadmap priorities.
  • Provide technical guidance to data engineers who build and maintain analytics and model evaluation infrastructure.
  • Communicate results clearly — through written reports, dashboards, and presentations — to technical and non-technical stakeholders.
  • Stay current on emerging practices in applied NLP, LLM evaluation, and data-driven product development, and thoughtfully adapt them to our environment.

Benefits

  • Flexible remote-first work environment, with the option to work from our New York office
  • Comprehensive health coverage, including medical, dental, and vision plans
  • Retirement plan with inclusive risk benefits (disability, critical illness, life cover, and funeral cover)
  • Modern family benefits, including adoption, surrogacy, and parental leave
  • Paid study leave and professional development support
  • Well-being initiatives and opportunities for sabbaticals and personal growth
  • A culture that values work/life balance, clear communication, and continuous learning
  • Health Benefits: Comprehensive, multi-carrier program for medical, dental and vision benefits
  • Retirement Benefits: 401(k) with match and an Employee Share Purchase Plan
  • Wellbeing: Wellness platform with incentives, Headspace app subscription, Employee Assistance and Time-off Programs
  • Short-and-Long Term Disability, Life and Accidental Death Insurance, Critical Illness, and Hospital Indemnity
  • Family Benefits, including bonding and family care leaves, adoption and surrogacy benefits
  • Health Savings, Health Care, Dependent Care and Commuter Spending Accounts
  • In addition to annual Paid Time Off, we offer up to two days of paid leave each to participate in Employee Resource Groups and to volunteer with your charity of choice

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

1,001-5,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service