Site Reliability Engineer

ModulateSomerville, MA
Hybrid

About The Position

Modulate is the leader in conversational voice intelligence. We enable enterprises to deeply understand how people communicate and take timely action based on those insights. Our products help detect harm, prevent fraud, and build safer, more trusted online and real-world voice environments. We are building a Conversation Intelligence Platform — APIs, workflows, and applications that bring voice understanding to customers at enterprise scale. We’re looking for our first Site Reliability Engineer to own and scale the reliability of our production systems. This role will be critical in ensuring high availability across our Developer APIs platform and Enterprise products, while building the foundations for monitoring, incident response, and operational excellence as we grow. Your Impact You will play a foundational role in ensuring Modulate’s systems are reliable, scalable, and enterprise-ready. Own the reliability and availability of Modulate’s production infrastructure Build monitoring, alerting, and incident response systems from the ground up Establish sustainable on-call practices that balance reliability with team health Partner with engineering and leadership to shape infrastructure and reliability strategy Help define how Modulate delivers enterprise-grade uptime and performance What You Will Do Own and operate production systems supporting Modulate’s APIs and enterprise products Design and implement monitoring, alerting, and observability systems Lead incident response, root cause analysis, and postmortem processes Build and improve on-call rotations and operational workflows Collaborate with engineers to deploy, maintain, and scale distributed systems Partner with leadership on infrastructure decisions, roadmaps, and reliability goals Evaluate and support deployment models including cloud, on-prem, and hybrid environments Continuously improve system performance, resilience, and scalability

Requirements

  • Experience deploying and maintaining production software systems
  • Experience building monitoring and alerting systems for production environments
  • Experience with on-call rotations and incident response
  • Strong experience with AWS, Python, and Linux
  • Familiarity with tools such as CloudWatch, SNS, PagerDuty, or similar technologies
  • Strong debugging, systems thinking, and problem-solving skills
  • Ability to communicate effectively during high-pressure incidents
  • Experience working in fast-paced or early-stage environments

Nice To Haves

  • Experience with AWS services such as EC2, load balancers, RDS, SQS, SES, and CloudWatch
  • Experience with infrastructure-as-code (e.g., Terraform, CloudFormation)
  • Experience supporting high-scale, distributed systems
  • Familiarity with hybrid or on-prem deployment models

Responsibilities

  • Own and operate production systems supporting Modulate’s APIs and enterprise products
  • Design and implement monitoring, alerting, and observability systems
  • Lead incident response, root cause analysis, and postmortem processes
  • Build and improve on-call rotations and operational workflows
  • Collaborate with engineers to deploy, maintain, and scale distributed systems
  • Partner with leadership on infrastructure decisions, roadmaps, and reliability goals
  • Evaluate and support deployment models including cloud, on-prem, and hybrid environments
  • Continuously improve system performance, resilience, and scalability

Benefits

  • Competitive salary + equity
  • Full health, dental, and vision coverage
  • Flexible PTO, with a strong culture of taking it
  • Weekly team lunches with dietary accommodations
  • Hybrid work: core in-office days with flexible remote options
  • Regular leadership and industry learning sessions
  • Support for career development and continued learning
  • Up to 8 weeks work-from-anywhere policy
  • A deeply inclusive, human-centered culture
  • HSA, FSA, 15 holidays, professional growth resources
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service