Vapi is seeking a Site Reliability Engineer (SRE) to drive 99.99% call completion. This role is critical because Vapi runs live phone calls, and any stability issues can lead to dropped calls. The SRE will be responsible for incident command, owning SLOs and error budgets, and building a reliability culture from the ground up. This is a hands-on role where you will ship code (Go or TypeScript) for services that monitor and manage the platform, including auto-remediation, capacity forecasters, and oncall tooling. Key responsibilities include capacity planning, load testing, and KEDA-based autoscaling for Vapi's wscaler and workerpool-cron-scaler.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed