Data & AI Intern

Verdesian AsiaCary, NC

About The Position

This internship focuses on organizing and preparing historical field trial documents (PDFs, PowerPoint files, spreadsheets) for AI-driven analysis. The intern will be responsible for capturing and validating document metadata, building and maintaining structured reference tables (including product and formulation tables, and product name/alias mappings), and assisting with defining and validating standardized trial data schemas. Key tasks also include reviewing and quality-checking AI-extracted data, flagging ambiguous or low-confidence data for expert review, and writing simple Python scripts for data cleaning, validation, and comparison. The intern will document assumptions, data standards, and edge cases to support long-term project use. The role offers hands-on experience applying AI to unstructured enterprise data, exposure to real-world data processing and AI workflow design, and an understanding of the connection between R&D, data science, and commercial decision-making in industry. Mentorship from experienced field scientists and data practitioners is provided, culminating in a portfolio-worthy project with tangible impact. By the end of the internship, contributions will include a standardized product and alias reference system, a searchable trial document repository, clean and validated datasets ready for AI extraction and analytics, and documentation to enable project scalability.

Requirements

  • Currently pursuing a Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related field
  • Experience working with structured data (tables, CSV files, basic schemas)
  • Basic proficiency in Python
  • Strong attention to detail and comfort working with messy, real-world data
  • Ability to document work clearly and communicate findings to technical and non-technical teammates.
  • Interest in applied AI, data science, agriculture, biology, or environmental science

Nice To Haves

  • Familiarity with NLP, machine learning, or large language models (coursework or projects) is a plus.

Responsibilities

  • Organize and prepare historical field trial documents (PDFs, PowerPoint files, spreadsheets) for AI-driven analysis
  • Capture and validate document metadata (crop, location, year, product, file type, etc.)
  • Build and maintain structured reference tables, including: Product and formulation tables, Product name and alias mappings (legacy names, internal codes, marketing names)
  • Assist with defining and validating standardized trial data schemas
  • Review and quality-check AI-extracted data from field trial documents
  • Help flag ambiguous or low-confidence data for expert review
  • Write simple scripts (e.g., Python) for data cleaning, validation, and comparison
  • Document assumptions, data standards, and edge cases to support long-term use

Benefits

  • Hands-on experience applying AI to unstructured enterprise data
  • Exposure to real-world data processing and AI workflow design
  • Understanding of how R&D, data science, and commercial decision-making connect in industry
  • Mentorship from experienced field scientists and data practitioners
  • A portfolio-worthy project with tangible, lasting impact
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service