We are seeking a highly motivated undergraduate intern to join the GPU Client AI team and contribute to AI inference performance optimization on Intel GPUs. This is a hands-on, technical internship designed for students who want deep exposure to real-world AI workloads, GPU performance optimization, and systems-level software engineering. This role is ideal for a student who can work part-time (~16 hours/week) for 6 months to 1 year and is interested in building strong foundations in AI software stacks, GPU programming, and performance optimization. As an intern, you will work closely with senior engineers and gradually take ownership of well-scoped technical tasks. This internship provides exposure to AI inference software stacks and how modern models map onto GPU hardware, basics of GPU architecture and parallel programming concepts, performance profiling tools and methodology for data-driven optimization, and real-world engineering workflows: code reviews, design discussions, and cross-team collaboration.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Part-time
Career Level
Intern
Number of Employees
5,001-10,000 employees