Staff Site Reliability Engineer

FinalsiteGlastonbury, CT
Remote

About The Position

The Staff Site Reliability Engineer will lead the evolution of Finalsite’s infrastructure, reliability, and observability practices across a multi-cloud environment. This role partners closely with engineering leadership to improve CI/CD, environment consistency, scalability, and operational excellence across critical platform initiatives.

Requirements

  • At least 10 years of experience as a Staff SRE or senior-level infrastructure engineer supporting production systems at scale
  • Strong expertise in AWS and GCP cloud platforms
  • Familiarity with queue-based or event-driven architectures and autoscaling technologies
  • Experience designing and improving CI/CD infrastructure and deployment automation
  • Hands-on Kubernetes operations experience, including scalability and workload reliability
  • Experience with observability tools, monitoring strategies, and incident management practices
  • Proficiency with Infrastructure-as-Code tools such as Terraform or comparable technologies
  • Strong communication and collaboration skills across engineering teams
  • Experience using AI-assisted development tools such as Claude Code, Codex, or similar technologies

Nice To Haves

  • Experience supporting platform modernization or large-scale migration initiatives
  • Experience supporting multi-tenant SaaS environments
  • Background leading or advancing modern SRE practices within infrastructure teams
  • Familiarity with Ruby/Rails or Python deployment environments
  • Experience within EdTech, SaaS, or other highly available production environments

Responsibilities

  • Design, improve, and maintain scalable CI/CD pipelines and deployment processes
  • Establish reliable staging and development environments aligned with production standards
  • Build and manage observability practices, including monitoring, alerting, dashboards, and SLO frameworks
  • Partner with engineering teams to support platform modernization and infrastructure reliability
  • Drive Infrastructure-as-Code (IaC) standards and multi-cloud operational consistency across AWS and GCP
  • Provide technical leadership on infrastructure architecture, reliability, and operational best practices
  • Support incident response, system reliability, and operational readiness initiatives
  • Utilize AI-assisted development tools to support infrastructure analysis and improvement efforts

Benefits

  • Equal opportunity workplace
  • Affirmative action employer
  • Commitment to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status
  • Consideration of qualified applicants regardless of criminal histories, consistent with legal requirements
  • Reasonable accommodation for persons with disabilities or special needs
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service