Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. As a Software Engineer Intern - Kernels, you will work closely with our senior AI Kernel Engineers to help enable a variety of AI/LLM models to run efficiently on the Quadric platform. This is a hands-on role where you will dive deep into hardware architecture and optimization techniques. You will gain invaluable experience developing, profiling, and optimizing kernel code, directly contributing to the performance of our AI inference stack. Note: Our preference is for a candidate willing to relocate to the California Bay Area who can regularly collaborate from our Burlingame office.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Career Level
Intern
Education Level
No Education Listed