This internship will extend the agentic scientific workflow framework (developed in a previous SULI internship) to execute workflows on HPC systems. Using the Academy middleware for building and deploying stateful agents across distributed systems, the intern will design and implement HPC-aware Academy agents capable of submitting, managing, and coordinating batch jobs on LCRC Improv/Bebop clusters. The goal is to enable the agentic workflow framework to autonomously install software, submit jobs, monitor execution, handle failures, and collect results on HPC machines.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Intern
Education Level
No Education Listed