Quadric has created an innovative software driven AI inference processor. Licensed as IP, the architecture is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other neural engines in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. As a new-grad Deep Learning Compiler Engineer, you will work on CGC, Quadric's neural network compiler that lowers ONNX models through Relay IR down to C++ targeting the Chimera GPNPU. You will own work in real compiler passes — layout selection, memory allocation, operator splitting, code generation — and your changes will ship into the code that runs on Quadric silicon. This is a hands-on engineering role on a small, senior team. You will design IR transformations, debug the C++ the compiler emits, and drive how efficiently neural networks map to our hardware. The ramp is steep, the codebase is large, and the feedback loop from your changes to running silicon is short. Note: We strongly prefer candidates willing to relocate to the California Bay Area and work from our Burlingame office.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Entry Level