The goal of this internship project is to create ETL (extract, transform, & Load) templates in Python for use with the TetraScience database (TDB). We would like to have assay databases for our SEC, CE-SDS (R/NR), and icIEF methods. Extraction templates will need to be created to act as a “trigger” in the TDB for data capture. This will first act as proof of concept for AD Use Cases and additionally will result in the continuous addition of new values to the databases over time. A secondary goal will be to restructure the HCP database created a few years ago for our characterization group. The database currently resides in the RStudio Format, which is not compatible with TetraScience. It will need to be transformed into Python if AD wishes to continue its use. Both goals are designed for accessibility and ease of use and will demonstrate the powerful functionality of the TDB. The intern will design and implement an internal, web-based analytical method database to improve the consistency and accuracy of data interpretation. This will include compiling data from common release methods such as SEC, icIEF, and CE-SDS. On the technical side, the intern will be responsible for developing the database architecture, building a searchable and intuitive web interface, and deploying the solution within the organization’s AWS environment using an appropriate framework such as PHP, ASP.NET, or Python. Success will require close coordination across functions, including collaboration with scientific subject matter experts to validate assay content and with digital/IT partners to ensure performance, integration, and long-term sustainability. The internship will conclude with delivery of a deployed prototype, complete documentation, user guidelines, and a final presentation of results and recommendations to stakeholders.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Intern
Number of Employees
5,001-10,000 employees