TT-Fabric Software Engineer

TenstorrentSanta Clara, CA
4h$100,000 - $500,000Hybrid

About The Position

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities. Tenstorrent is building the fastest and most efficient AI compute clusters in the world. TT-Fabric is the low-level networking software layer that enables thousands of RISC-V and AI processors to operate as a unified distributed system. This role is hybrid based out of Santa Clara, CA. We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

Requirements

  • Strong systems engineer with deep C or C++ experience and comfort working in low-level or bare-metal environments.
  • Passionate about hardware-software interaction, performance tuning, and eliminating inefficiencies at the protocol level.
  • Curious about networking, synchronization, and communication across large clusters.
  • Comfortable reasoning from first principles and challenging industry conventions.
  • Motivated by building infrastructure that directly impacts large-scale AI training and inference performance.

Responsibilities

  • Architect, implement, and maintain TT-Fabric, our low-level networking library powering distributed inference and training.
  • Design scalable communication systems capable of coordinating thousands of AI processors efficiently and reliably.
  • Optimize protocols, synchronization strategies, and data movement to extract maximum hardware performance.
  • Integrate TT-Fabric APIs into the broader programming model in collaboration with AI and hardware teams.
  • Help define the long-term architecture of Tenstorrent’s distributed systems stack.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service