AI Kernel Engineer (New Grad)

quadric, IncBurlingame, CA
Hybrid

About The Position

Quadric has created an innovative software driven AI inference processor. Licensed as IP, the architecture is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other neural engines in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. The AI Kernel Engineer (New Grad) at Quadric plays a key role in enabling a large number of AI kernels and operators to run efficiently on the Quadric platform. In this role, you will [1] develop a highly efficient Quadric kernel library for a variety of AI/LLM models; [2] analyze performance and optimize kernels for different hardware configurations. This technical role demands a strong foundational knowledge of hardware architecture, software optimization, and the hardware-software interface. This role is based in our Burlingame, California office. We believe strong technical collaboration, rapid iteration, and shared problem-solving are best supported by working together in person. As such, this role follows a hybrid schedule with at least two in-office days per week expected.

Requirements

  • Bachelor's, Master's, or PhD in Computer Science, Electrical Engineering, or a related field.
  • Strong proficiency in C/C++ and Python.
  • Solid foundational understanding of computer architecture and hardware-software interaction.
  • Demonstrated capability in problem solving, debugging, and clear technical communication.
  • Experience with (or academic exposure to) at least one of the following compute development frameworks: CUDA, DSP, NEON, or Triton-lang.
  • Must be willing to relocate to the California Bay Area and work from the Burlingame office on a hybrid basis.

Nice To Haves

  • Familiarity with assembly language or compiler internals is a plus.
  • Experience with model and kernel inference performance profiling is highly desirable.

Responsibilities

  • Develop AI/LLM kernels and operators on the Quadric platform for efficient inference.
  • Optimize kernel performance for different hardware configurations and workloads.
  • Profile and analyze kernel performance in terms of compute, data, and parallelism; identify micro-architecture and software bottlenecks and provide optimization solutions.
  • Optimize kernel C/C++ code to maximize hardware utilization.
  • Collaborate across related areas of the AI inference stack to support team and business priorities.
  • Make improvements to the Quadric toolchain, compiler, and runtime.
  • Provide technical support and documentation to customers and the developer community.

Benefits

  • Competitive salary and meaningful equity
  • Medical, dental, and vision plan options starting on day one
  • 401(k) retirement plan
  • Flexible paid time off (unlimited, non-accrual) to support work-life balance
  • When working in-office, enjoy company-provided lunches and a stocked kitchen
  • Convenient office location within walking distance of the Caltrain station
  • Support for commuting, including monthly parking or Caltrain passes
  • Downtown Burlingame office location, close to shops, cafes, and local amenities
  • A politics-free, highly collaborative environment where talented people can do their best work and make an immediate impact
  • The opportunity to build long-term career relationships in a company that values strong personal connections alongside professional excellence
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service