AI Model Architect

EricssonAustin, TX
Onsite

About The Position

As our Principal AI Model Architect, you occupy the most strategically critical seat in our entire silicon program. You are the living contract between what our researchers dream up and what our silicon team can physically build. You are the person in the room who looks at a state-of-the-art Transformer architecture and answers the question no one else can: "Here's exactly how we break this apart, map it across our heterogeneous ASIC, and run it faster than anyone else on earth — and here's the proof." This isn't model fine-tuning. This isn't prompt engineering. This is deep, architecture-level surgery — partitioning massive parameter models, defining tiling strategies, projecting cycle-accurate performance on silicon that doesn't exist yet, and ensuring the SDK team has a mathematically airtight path to make it all real. Your decisions don't just influence software. They get etched into silicon.

Requirements

  • Deep, battle-tested knowledge of Transformer architectures — not just how they work conceptually, but how every design choice (attention head count, KV-cache sizing, embedding strategies, GQA vs. MQA trade-offs) ripples through a hardware execution profile.
  • Experience in the "Hardware-in-the-Loop" world, thinking about cache line behavior, memory wall bottlenecks between HBM and SRAM, and how SIMD and VLIW execution units reward or punish specific model shapes.
  • Advanced proficiency in JAX (strongly preferred), PyTorch, or TensorFlow — specifically at the export and compilation layer.
  • Comfort with graph capture, XLA compilation, and StableHLO representations.
  • Experience with SystemC, Transaction-Level Modeling, or custom cycle-accurate simulation frameworks.
  • Experience using these tools to validate architectural decisions before silicon is committed.

Nice To Haves

  • Applied AI to real RAN workloads — channel estimation, beamforming, interference management at L1/L2/L3.
  • Understanding why 5G inference isn't just a data center problem in a smaller box.
  • Instinctively understand the journey from a high-level compute graph to a linearized, scheduled, hardware-bound execution sequence.
  • Understanding what gets lost in that translation and how to protect against it.
  • Hands-on experience with complex-valued AI models and the specific challenges they create when mapping to DSP and matrix accelerator hardware.
  • Understanding of fixed-point quantization, dynamic range, numerical stability under precision reduction.

Responsibilities

  • Take bleeding-edge AI and RAN algorithms — Transformers, Grouped Query Attention, Rotary Positional Embeddings — and convert them into precise hardware specifications for the ASIC team and concrete lowering requirements for the SDK team.
  • Define the strategies for how massive, multi-hundred-million parameter models get decomposed and mapped across heterogeneous compute fabrics.
  • Own and maintain the canonical reference implementations in JAX and PyTorch — the undisputed "Source of Truth" that the entire program aligns to.
  • Project performance using cycle-accurate simulators, SystemC models, and your own deep intuition for how model architectures behave under hardware constraints.
  • Work hand-in-hand with the SDK team to ensure that what the researcher intended and what the hardware executes are identical, bit for bit.

Benefits

  • Choice of three medical plan options
  • Dental plan option
  • Company credits in an amount equal to the cost that Ericsson pays toward the cost of their medical and dental premiums for themselves and eligible covered dependents
  • Automatic 3% company contribution to 401(k)
  • Ericsson match $1 for every $1 you put into the 401(k) Plan on the first 3% of your eligible pay, plus 50 cents on every $1 on the next 2% of eligible pay
  • Company credits in an amount equal to the cost of basic life insurance and basic accidental death and dismemberment coverage, as well as short-term and long-term disability coverage
  • Option to participate in Ericsson’s Stock Purchase Plan
  • Minimum of 15 days of accrued vacation
  • Up to 3 personal days per year
  • 11 annual holidays
  • 8 hours of volunteer time
  • 80 hours of sick time annually
  • Up to 16 weeks of paid maternity leave
  • 6 weeks of parental or adoption leave at 100% of pay
  • Financial wellness programs
  • Educational assistance
  • Matching gifts
  • Recognition programs
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service