Staff Site Reliability Engineer

Finalsite•Glastonbury, CT

1d•Remote

About The Position

The Staff Site Reliability Engineer will lead the evolution of Finalsite’s infrastructure, reliability, and observability practices across a multi-cloud environment. This role partners closely with engineering leadership to improve CI/CD, environment consistency, scalability, and operational excellence across critical platform initiatives.

Requirements

At least 10 years of experience as a Staff SRE or senior-level infrastructure engineer supporting production systems at scale
Strong expertise in AWS and GCP cloud platforms
Familiarity with queue-based or event-driven architectures and autoscaling technologies
Experience designing and improving CI/CD infrastructure and deployment automation
Hands-on Kubernetes operations experience, including scalability and workload reliability
Experience with observability tools, monitoring strategies, and incident management practices
Proficiency with Infrastructure-as-Code tools such as Terraform or comparable technologies
Strong communication and collaboration skills across engineering teams
Experience using AI-assisted development tools such as Claude Code, Codex, or similar technologies

Nice To Haves

Experience supporting platform modernization or large-scale migration initiatives
Experience supporting multi-tenant SaaS environments
Background leading or advancing modern SRE practices within infrastructure teams
Familiarity with Ruby/Rails or Python deployment environments
Experience within EdTech, SaaS, or other highly available production environments

Responsibilities

Design, improve, and maintain scalable CI/CD pipelines and deployment processes
Establish reliable staging and development environments aligned with production standards
Build and manage observability practices, including monitoring, alerting, dashboards, and SLO frameworks
Partner with engineering teams to support platform modernization and infrastructure reliability
Drive Infrastructure-as-Code (IaC) standards and multi-cloud operational consistency across AWS and GCP
Provide technical leadership on infrastructure architecture, reliability, and operational best practices
Support incident response, system reliability, and operational readiness initiatives
Utilize AI-assisted development tools to support infrastructure analysis and improvement efforts

Benefits

Equal opportunity workplace
Affirmative action employer
Commitment to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status
Consideration of qualified applicants regardless of criminal histories, consistent with legal requirements
Reasonable accommodation for persons with disabilities or special needs

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume