Data Scientist - Model Optimization

quadric.ioBurlingame, CA
17hOnsite

About The Position

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high-performance automotive or autonomous vehicle systems. Unlike other NPUs or neural network accelerators in the industry today that can only accelerate a portion of a machine learning graph, the Quadric GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: You will be joining the data science team focused on model optimization for Quadric's custom GPNPU architecture. You will research, prototype, and implement novel quantization algorithms tailored to our hardware constraints. Beyond applying existing techniques, you'll develop custom low-precision methods that maximize performance on the Chimera GPNPU. Your work will directly shape the quantization capabilities in the Chimera SDK and influence future hardware features. This California Bay Area based engineering role is intended to be primarily in-office at our Burlingame location, with the ability to commute regularly. We believe strong technical collaboration, rapid iteration, and shared problem-solving are well supported by working together in person. The team and company also gather periodically for onsite meetings and offsite events to connect, collaborate, and align on priorities.

Requirements

  • M.S./Ph.D. in CS, EE, Applied Math, or similar, with 5+ years in ML model optimization or data-science-driven research.
  • Deep grasp of fixed-point arithmetic, quantization theory, numerical analysis, and statistical calibration.
  • Strong ability to implement quantization algorithms from first principles, not just use existing frameworks.
  • Fluent in Python, PyTorch or TensorFlow, NumPy/Pandas/SciPy, and data-viz tools (Matplotlib/Plotly).
  • Experience implementing custom quantizers and understanding their interaction with hardware constraints (bit-width, format, operations).
  • Hands-on with at least one quantization toolkit (PyTorch FX/PTQ/QAT, TF-Lite, ONNX-Runtime, TVM, MLIR Quant) and ability to extend them.
  • Working knowledge of CNNs, Transformers, and DNN architectures.

Nice To Haves

  • Experience with custom hardware accelerators, DSPs, or neural processing units.

Responsibilities

  • Design statistically rigorous experiments to compare PTQ, QAT, and mixed-precision schemes on vision, language, and multimodal models.
  • Implement custom quantization algorithms from scratch, adapting existing techniques or developing novel approaches to match Chimera GPNPU's unique architectural features and numerical formats.
  • Build calibration datasets; develop Python notebooks/dashboards to track accuracy, latency, power, and memory trade-offs.
  • Perform layer-level error analysis to guide numerical-format choices.
  • Partner with compiler team to convert your findings into turnkey SDK flows and reference configs.
  • Publish internal white papers, external benchmarks, and present results to customers and at industry events.
  • Monitor academic literature in compression and efficient inference; translate promising ideas into reproducible prototypes.

Benefits

  • Competitive salary and meaningful equity
  • Medical, dental, and vision plan options starting on day one
  • 401(k) retirement plan
  • Flexible paid time off (unlimited, non-accrual) to support work-life balance
  • When working in-office, enjoy company-provided lunches and a stocked kitchen
  • Convenient office location within walking distance of the Caltrain station
  • Support for commuting, including monthly parking or Caltrain passes
  • Downtown Burlingame office location, close to shops, cafes, and local amenities
  • A politics-free, highly collaborative environment where talented people can do their best work and make an immediate impact
  • The opportunity to build long-term career relationships in a company that values strong personal connections alongside professional excellence
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service