Staff Software Engineer, Cloud Infrastructure

Tenstorrent
2h$100,000 - $500,000Hybrid

About The Position

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities. We're looking for a Staff Software Engineer with a platform infrastructure / Site Reliability Engineering (SRE) background to join us to work on infrastructure automation, integration, and operations. Work covers backend development, service integrations, infrastructure-as-code, and site reliability engineering aspects. Adjacent experience up or down the stack is also highly valuable. This role is hybrid OR remote, based out of the United States or Toronto. We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.

Requirements

  • Fluent in Python, Infrastructure-as-Code (Ansible), shell scripting, Linux SysOps, and CI/CD.
  • DevOps mindset with experience in software integrations and operational infrastructure.
  • Experienced in observability, including hardware, system, and application level telemetry, monitoring, and alerting (Prometheus, Loki, Alloy, Grafana, Sentry, SNMP, Redfish, IPMI).
  • Familiarity with Bare Metal, Virtual Machine and Kubernetes provisioning and operations.

Nice To Haves

  • Neocloud / CSP background is a plus.

Responsibilities

  • Hands-on software engineering to push infrastructure and operational excellence further.
  • Effective collaboration with end-users, peers, domain experts, and stakeholders.
  • Leadership to grow teams’ capabilities and eagerness to learn more.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service