Site Reliability Engineer

LiteraRaleigh, NC
Hybrid

About The Position

Join the Legal Tech Revolution at Litera. Are you ready to shape the future of how law is practiced? At Litera, we’re on a mission to Raise The Bar™️ for the legal profession by delivering transformational and globally-trusted solutions to law firms and corporate legal teams worldwide. We’ve been a leader in legal tech innovation for 30 years and are leading the legal AI revolution to this day with most of the world’s largest law firms as our clients. If you’re passionate about building AI-forward solutions that scale globally and want your work to impact millions of legal professionals worldwide, this is your opportunity to be part of an extraordinary team that’s elevating the craft of law. This position is hybrid based in Denver, Colorado or Raleigh, North Carolina and candidates should reside within reasonable commuting distance, as this role requires on-site presence at least three days per week. Role Overview: As a Site Reliability Engineer (SRE) at Litera, you will play a key role in ensuring our SaaS solutions remain stable, scalable, and resilient. Your primary focus will be on enhancing operational efficiency and reliability across diverse cloud environments and their core infrastructure. You’ll leverage automation to streamline workflows, develop innovative tools, optimize monitoring and alerting systems, and provide rapid incident response. This is a hands-on, team-oriented position that values collaboration and proactive problem-solving.

Requirements

  • 5 Years of experience as an SRE
  • Experience performing log analysis and software remediation
  • Experience using configuration management tools (Terraform, Puppet, Ansible)
  • Experience working with cloud platforms (Azure/AWS) and remote collocated systems
  • Familiarity with AI-driven tools such as Claude AI, GitHub Copilot, Cursor, or Devin AI
  • Deep knowledge in monitoring and alerting tools (New Relic, Datadog, Dynatrace)
  • Confidence in navigating and troubleshooting operating systems (Windows, Linux)
  • Communicates clearly and effectively to drive alignment across teams
  • Collaborates cross-functionally to deliver results and move work forward
  • Strong sense of urgency when it matters

Nice To Haves

  • Knowledge of the mainstream databases (writing queries, tuning)
  • Experience designing and troubleshooting large-scale distributed systems
  • Strong grasp of software engineering principles and agile practices
  • Knowledge of how distributed systems scale, fail, and recover
  • Capable of recommending improvements and rallying others to implement them
  • The ability to communicate effectively and break down complex issues
  • The ability to learn quickly and navigate ambiguity comfortably
  • The ability to remain calm and focused under pressure
  • Experience working in regulated environments such as GDPR, SOX, HIPAA, PCI
  • Certifications in AWS, Azure, or other relevant cloud platforms
  • Hands-on software development experience
  • Core background in networking

Responsibilities

  • Own SaaS application reliability and architecture
  • Solve complex systems issues
  • Support and escalated customer-facing issues
  • Participate in a 24/7 on-call rotation
  • Lead root cause analysis and remediation efforts
  • Automate workflows to improve efficiency
  • Build and maintain dashboards for real-time monitoring
  • Drive reliability improvements across cloud infrastructure
  • Enforce security and compliance standards
  • Coordinate cross-team incident resolution
  • Manage disaster recovery and operational playbooks
  • Mentor engineers and foster technical excellence
  • Be a reliable, collaborative presence on the team

Benefits

  • health insurance
  • retirement savings plans
  • generous paid time off
  • supportive work-life balance
  • health, dental, and vision insurance
  • 401(k) with company contribution
  • incentive and recognition programs
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service