Site Reliability Engineer - AI Cloud

Super Micro ComputerSan Jose, CA
245d$145,000 - $165,000

This job is no longer available

There are still lots of open positions. Let's find the one that's right for you.

About The Position

As a Cloud Reliability Engineer for our Linux-based AI cloud platforms, you will help us deploy, scale, and ensure high availability, performance, scalability, and security across GPU-accelerated compute clusters, Kubernetes workloads, and supporting storage/network infrastructure. You’ll bridge Dev and Ops by automating infrastructure deployment, enhancing observability, and applying SRE best practices to support reliable AI and MLOps environments.

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service