This project focuses on adapting and optimizing domain decomposition solvers specifically for the Aurora supercomputer architecture. Leveraging the student's expertise in SYCL and finite element tearing and interconnect (FETI) methods, the work will target the performance characteristics of Intel GPUs. The student will profile existing solver bottlenecks and implement optimizations such as tuning memory access patterns to maximize High Bandwidth Memory (HBM) utilization and exploiting multi-tile parallelism within Aurora's nodes. The ultimate goal is to demonstrate efficient strong scaling of the solver on the Aurora supercomputer at ALCF.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Intern
Education Level
No Education Listed