Compiler Optimization Engineer

Lemurian LabsToronto, ON

About The Position

We're looking for a Graph Optimization Compiler Engineer to own the middle tier of our AI compiler stack — the layer where high-level model graphs are transformed, simplified, and made ready for efficient code generation. You'll design and implement the optimization passes that make the difference between a model that runs and a model that flies. This role sits between our compiler front end and code generation backend. You'll work on graph-level transformations — fusion, layout optimization, dead code elimination, constant folding, and more — with a direct line of sight to the performance outcomes your work produces. If you think in data flow graphs and optimization passes, and you want that thinking to power the next generation of AI infrastructure, we'd love to talk.

Requirements

  • BS degree in Computer Science, Computer Engineering, or equivalent practical experience
  • 4+ years of experience working with compilers, with a focus on intermediate representation design or optimization passes
  • Deep knowledge of graph-level compiler optimization techniques — fusion, tiling, layout transformations, and related methods
  • 4+ years of experience with C/C++
  • Strong written and verbal communication skills; ability to write clear and concise technical documentation

Nice To Haves

  • Master's or PhD in Computer Science, Computer Engineering, or equivalent
  • Experience with polyhedral models or affine analysis for loop and tensor optimization
  • Familiarity with hardware memory hierarchies and how layout decisions impact performance on GPUs or accelerators
  • Experience working with MLIR, XLA, or similar graph-level IR frameworks
  • Experience with ML framework internals — PyTorch eager/compile mode, JAX/XLA, or TensorRT
  • Strong understanding of ML model architectures and their computational patterns (attention, convolution, normalization, etc.)
  • Knowledge of quantization, sparsity, or other model-level optimization techniques
  • Contributions to open-source compiler or ML infrastructure projects

Responsibilities

  • Design, develop, and maintain the graph optimization layer of our heterogeneous AI compiler
  • Implement and extend graph-level transformation passes including operator fusion, layout propagation, dead code elimination, constant folding, and algebraic simplification
  • Define and evolve our intermediate representation (IR) to support new optimization opportunities as ML model architectures advance
  • Analyze performance data to identify optimization gaps and drive measurable improvements in throughput and latency
  • Collaborate with front end and code generation teams to ensure clean IR interfaces and well-structured optimization pipelines
  • Propose and prototype new optimization strategies in response to advances in model design and hardware capabilities
  • Contribute to testing and validation infrastructure to ensure optimization correctness across model types and hardware targets

Benefits

  • Competitive compensation including equity, medical/dental/vision, retirement savings, and wellness benefits
  • equity
  • company bonus opportunities
  • medical, dental, and vision coverage
  • a retirement savings plan
  • supplemental wellness benefits
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service