This internship focuses on organizing and preparing historical field trial documents (PDFs, PowerPoint files, spreadsheets) for AI-driven analysis. The intern will be responsible for capturing and validating document metadata, building and maintaining structured reference tables (including product and formulation tables, and product name/alias mappings), and assisting with defining and validating standardized trial data schemas. Key tasks also include reviewing and quality-checking AI-extracted data, flagging ambiguous or low-confidence data for expert review, and writing simple Python scripts for data cleaning, validation, and comparison. The intern will document assumptions, data standards, and edge cases to support long-term project use. The role offers hands-on experience applying AI to unstructured enterprise data, exposure to real-world data processing and AI workflow design, and an understanding of the connection between R&D, data science, and commercial decision-making in industry. Mentorship from experienced field scientists and data practitioners is provided, culminating in a portfolio-worthy project with tangible impact. By the end of the internship, contributions will include a standardized product and alias reference system, a searchable trial document repository, clean and validated datasets ready for AI extraction and analytics, and documentation to enable project scalability.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Career Level
Intern
Number of Employees
101-250 employees