Data & AI Intern

Verdesian Asia•Cary, NC

About The Position

This internship focuses on organizing and preparing historical field trial documents (PDFs, PowerPoint files, spreadsheets) for AI-driven analysis. The intern will be responsible for capturing and validating document metadata, building and maintaining structured reference tables (including product and formulation tables, and product name/alias mappings), and assisting with defining and validating standardized trial data schemas. Key tasks also include reviewing and quality-checking AI-extracted data, flagging ambiguous or low-confidence data for expert review, and writing simple Python scripts for data cleaning, validation, and comparison. The intern will document assumptions, data standards, and edge cases to support long-term project use. The role offers hands-on experience applying AI to unstructured enterprise data, exposure to real-world data processing and AI workflow design, and an understanding of the connection between R&D, data science, and commercial decision-making in industry. Mentorship from experienced field scientists and data practitioners is provided, culminating in a portfolio-worthy project with tangible impact. By the end of the internship, contributions will include a standardized product and alias reference system, a searchable trial document repository, clean and validated datasets ready for AI extraction and analytics, and documentation to enable project scalability.

Requirements

Currently pursuing a Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related field
Experience working with structured data (tables, CSV files, basic schemas)
Basic proficiency in Python
Strong attention to detail and comfort working with messy, real-world data
Ability to document work clearly and communicate findings to technical and non-technical teammates.
Interest in applied AI, data science, agriculture, biology, or environmental science

Nice To Haves

Familiarity with NLP, machine learning, or large language models (coursework or projects) is a plus.

Responsibilities

Organize and prepare historical field trial documents (PDFs, PowerPoint files, spreadsheets) for AI-driven analysis
Capture and validate document metadata (crop, location, year, product, file type, etc.)
Build and maintain structured reference tables, including: Product and formulation tables, Product name and alias mappings (legacy names, internal codes, marketing names)
Assist with defining and validating standardized trial data schemas
Review and quality-check AI-extracted data from field trial documents
Help flag ambiguous or low-confidence data for expert review
Write simple scripts (e.g., Python) for data cleaning, validation, and comparison
Document assumptions, data standards, and edge cases to support long-term use

Benefits

Hands-on experience applying AI to unstructured enterprise data
Exposure to real-world data processing and AI workflow design
Understanding of how R&D, data science, and commercial decision-making connect in industry
Mentorship from experienced field scientists and data practitioners
A portfolio-worthy project with tangible, lasting impact

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume