Data & AI Intern

Verdesian BrandCary, NC

About The Position

This role is for a Data & AI Intern who will be responsible for organizing and preparing historical field trial documents (PDFs, PowerPoint files, spreadsheets) for AI-driven analysis, capturing and validating document metadata, and building structured reference tables including product and formulation tables and product name and alias mappings. The intern will also assist with defining and validating standardized trial data schemas, reviewing and quality-checking AI-extracted data, helping flag ambiguous or low-confidence data for expert review, and writing simple scripts (e.g., Python) for data cleaning, validation, and comparison. Documentation of assumptions, data standards, and edge cases to support long-term use is also a key part of the role. The internship offers hands-on experience applying AI to unstructured enterprise data, exposure to real-world data processing and AI workflow design, and an understanding of how R&D, data science, and commercial decision-making connect in industry. Mentorship from experienced field scientists and data practitioners is provided, and the intern will contribute to a portfolio-worthy project with tangible, lasting impact, including a standardized, documented product and alias reference system, a searchable, well-organized trial document repository, and clean, validated datasets ready for AI extraction and analytics. The intern will also contribute to documentation enabling the project to scale beyond the internship.

Requirements

  • Currently pursuing a Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related field
  • Experience working with structured data (tables, CSV files, basic schemas)
  • Basic proficiency in Python
  • Strong attention to detail and comfort working with messy, real-world data
  • Ability to document work clearly and communicate findings to technical and non-technical teammates

Nice To Haves

  • Familiarity with NLP, machine learning, or large language models (coursework or projects) is a plus
  • Interest in applied AI, data science, agriculture, biology, or environmental science

Responsibilities

  • Organize and prepare historical field trial documents (PDFs, PowerPoint files, spreadsheets) for AI-driven analysis
  • Capture and validate document metadata (crop, location, year, product, file type, etc.)
  • Build and maintain structured reference tables, including: Product and formulation tables
  • Build and maintain structured reference tables, including: Product name and alias mappings (legacy names, internal codes, marketing names)
  • Assist with defining and validating standardized trial data schemas
  • Review and quality-check AI-extracted data from field trial documents
  • Help flag ambiguous or low-confidence data for expert review
  • Write simple scripts (e.g., Python) for data cleaning, validation, and comparison
  • Document assumptions, data standards, and edge cases to support long-term use

Benefits

  • Hands-on experience applying AI to unstructured enterprise data
  • Exposure to real-world data processing and AI workflow design
  • Understanding of how R&D, data science, and commercial decision-making connect in industry
  • Mentorship from experienced field scientists and data practitioners
  • A portfolio-worthy project with tangible, lasting impact
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service