About The Position

Filevine is a Legal AI company delivering Legal Operating Intelligence for the future of legal work. Grounded in a singular system of truth, Filevine brings together data, documents, workflows, and teams into one unified platform—where modern legal work happens with clarity and consistency. Powered by LOIS, the Legal Operating Intelligence System, Filevine connects context across every matter to transform legal operations from reactive to proactive. LOIS reads, understands, and reasons across your data to surface insight, automate complexity, and give professionals the clarity and confidence to see more, know more, and do more. Fueled by a team of exceptional collaborators and innovators, Filevine’s rapid growth has earned AI awards and recognition from Deloitte and Inc. as one of the most innovative and fastest-growing technology companies in the country. Role Summary: Filevine is hiring a VP of Engineering, Reliability to lead one of the most critical functions in our engineering organization. This leader will own the strategy, people, operations, and outcomes for the teams responsible for infrastructure, site reliability, database engineering, observability, and incident management across Filevine's platform. This is not a maintenance role. We are looking for a leader who will assess our current reliability posture and operating model with fresh eyes, define a forward-looking vision for how reliability engineering should work at Filevine, and execute that vision at the pace of an AI business. The right candidate has led Reliability organizations through similar inflection points, brings strong convictions about what "good" looks like at scale, and has the operational credibility and executive presence to drive meaningful change.

Requirements

  • Extensive Leadership: 15+ years of engineering experience, with 7+ years specifically leading infrastructure, reliability, or platform teams at scale in product-driven companies.
  • Organizational Scale: Proven track record managing organizations of 40+ engineers across SRE, DevOps, and Tooling, including developing multiple layers of management.
  • Strategic Evolution: Demonstrated experience evolving reliability operating models to meet the shifting needs of a scaling business.
  • High-Trust Environments: Deep expertise operating in regulated sectors (Legal Tech, Fintech, Gov, or Healthcare) where compliance and data sensitivity are primary constraints.
  • SRE Mastery: Practical, production-hardened understanding of SRE principles, including SLOs, error budgets, toil reduction, and incident management.
  • Cloud-Native Fluency: Strong technical command of AWS, container orchestration, Terraform (IaC), CI/CD, and modern observability stacks.
  • Financial & Resource Stewardship: Direct experience owning cloud infrastructure budgets and successfully driving meaningful cost optimization and forecasting.
  • AI/ML Infrastructure: Familiarity with the reliability requirements for modern AI workloads, such as model serving, vector search, and data pipeline integrity.
  • Executive Presence: Ability to engage the C-suite on risk trade-offs and transformation progress with a "builder mentality" that thrives on solving complex, high-stakes problems.

Responsibilities

  • Strategic Vision: Define and execute the reliability engineering roadmap, aligning infrastructure and AI-native architecture with Filevine’s enterprise growth and platform modernization.
  • Operating Model Evolution: Balance centralized platform capabilities with distributed ownership, ensuring the reliability model scales across a diversifying technology portfolio.
  • Performance Frameworks: Establish and manage SLO/SLI/error budget frameworks to create a shared language for balancing feature velocity with system stability.
  • Efficiency & Planning: Lead infrastructure cost management (optimization and forecasting), capacity planning, and disaster recovery to meet rigorous enterprise contractual commitments.
  • Organizational Development: Lead and scale a multi-disciplinary organization (DevOps, SRE, DBRE, Tooling), fostering a culture of ownership, high craftsmanship, and clear career growth.
  • Operational Excellence: Drive continuous improvement through DORA metrics, incident trend analysis, and systematic toil reduction to enhance service availability and deployment health.
  • Developer Empowerment: Delivery of self-service tooling, guardrails, and documentation that allow feature teams to operate their own services effectively without bottlenecks.
  • Security & Compliance: Act as the primary engineering interface for the CISO to advance compliance posture (FedRAMP, SOC 2, CJIS, ISO) and translate security needs into pragmatic action.
  • Executive Partnership: Collaborate with the CTO, CPO, and Architect to communicate risks and investment needs, positioning reliability as a key enabler for enterprise go-to-market success.

Benefits

  • A dynamic, rapidly growing company, focused on helping organizations thrive
  • Medical, Dental, & Vision Insurance (for full-time employees)
  • Competitive & Fair Pay
  • Maternity & paternity leave (for full-time employees)
  • Short & long-term disability
  • Opportunity to learn from a dedicated leadership team
  • Top-of-the-line company swag
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service