LexisNexis Reed Tech-posted 8 days ago
Full-time • Manager
Hybrid • Raleigh, NC
5,001-10,000 employees

Platform Engineering & Site Reliability Engineering (SRE) Manager About Us At LexisNexis Intellectual Property Solutions, our mission is to bring clarity to innovation by delivering better outcomes to the innovation community. Every day, our teams support the development of new technologies and processes that ultimately advance humanity. By empowering innovators to make informed decisions, be more productive, comply with regulations, and achieve superior results, we directly serve those shaping the future of humankind. About the Team As part of our Digital Operations and Infrastructure Technology organization, you’ll lead a distributed team of SRE and DevOps engineers supporting mission-critical applications across our Government Intellectual Property (IP) and commercial divisions . You’ll help modernize how we build, deploy, and operate systems—driving reliability, automation, and observability at scale. About the Role LexisNexis Reed Tech is seeking a PE & Site Reliability Engineering (SRE) Manager to lead and evolve our global infrastructure reliability strategy. This pivotal role combines strategic leadership with hands-on technical direction to ensure secure, scalable, and resilient systems that power our Government Intellectual Property (IP) and commercial divisions. Conditions of Employment: You must be a U.S. citizen to apply for this position. Hybrid Position required to report to one of the following Horsham, PA, Washington, DC (Alexandria, VA) or Raleigh, NC

  • Hire, mentor, and lead a high-performing, globally distributed team of SRE and DevOps engineers.
  • Foster a culture of reliability, blameless postmortems, and continuous improvement.
  • Build and sustain a global SRE community of practice that aligns reliability standards across business units.
  • Drive cross-functional initiatives and influence enterprise-wide engineering practices.
  • Define and implement SRE best practices to improve reliability, scalability, and performance.
  • Establish and monitor key performance indicators (uptime, MTTR, SLO/SLI compliance).
  • Serve as an escalation point for major incidents, ensuring swift resolution and actionable post-incident reviews.
  • Partner with Product, Cloud Infrastructure, Security, and Architecture teams to ensure alignment with enterprise objectives.
  • Collaborate with Cloud Engineering and Architecture to build robust monitoring, alerting, and observability systems.
  • Lead modernization initiatives, including cloud migrations, IaC automation (Terraform, Kubernetes), and CI/CD pipeline improvements.
  • Drive cloud cost efficiency and governance (FinOps).
  • Ensure compliance with ISO 27001, NIST 800-53, and similar security frameworks.
  • Define and implement SLOs, SLIs, and SLAs for AI/ML pipelines, APIs, and model training systems.
  • Partner with AI/ML and Cloud teams to ensure the reliability, observability, and performance of AI workloads.
  • Lead reliability engineering for MLOps — orchestration, IaC, monitoring, and automated scaling.
  • Champion security, compliance, and fault tolerance across emerging AI platforms.
  • Provide clear direction, feedback, and professional growth opportunities for team members.
  • Encourage innovation, continuous learning, and adoption of new reliability and automation techniques.
  • Lead with a global mindset, balancing local autonomy with enterprise alignment.
  • Bachelor’s degree in computer science, Engineering, or related field (advanced degree preferred).
  • Experience as a Sr. SRE, platform engineering, or DevOps, including several years in a global leadership role.
  • Proven experience leading distributed technical teams and aligning cross-functional stakeholders.
  • Strong expertise in Azure and/or AWS, Kubernetes (EKS/AKS), Terraform, and CI/CD tooling.
  • Background in observability, automation, incident management, and service reliability. Experience with AI/ML infrastructure (Databricks, MLflow, MLOps).
  • Cloud & Infrastructure: Azure, AWS (EKS, EC2, S3, RDS, Lambda, Azure VMs, Functions)
  • Infrastructure as Code: Terraform (modules, workspaces, policies), Ansible, ARM/BICEP/HCL, Spacelift
  • Containers & Orchestration: Docker, Kubernetes, Helm, ArgoCD
  • Monitoring & Observability: Datadog, Splunk, Coralogix, CloudWatch, Azure Monitor
  • Automation & Scripting: Python, Bash, PowerShell, TypeScript
  • Security & Networking: Azure Key Vault, HashiCorp Vault, cloud security best practices
  • Programming Familiarity: Java, .NET/C#, SQL, React environments
  • Empathetic and motivational leader who develops technical talent and fosters collaboration.
  • Excellent communicator capable of engaging both technical and business stakeholders.
  • Deep commitment to transparency, reliability culture, and continuous improvement.
  • Familiarity with FinOps, multi-cloud, or large-scale inference environments.
  • Understanding of ISO 27001, NIST 800-53, and NCSC-aligned frameworks.
  • Knowledge of data governance, privacy, or AI model compliance.
  • Comprehensive, multi-carrier health plan benefits
  • Disability insurance
  • Dependent care and commuter spending accounts
  • Life and accident insurance
  • Retirement benefits (salary investment plan/employer stock purchase plan)
  • Modern family benefits, including adoption and surrogacy
  • Wellness platform with incentives, Headspace app subscription, Employee Assistance and Time-off Programs
  • Short-and-Long Term Disability, Life and Accidental Death Insurance, Critical Illness, and Hospital Indemnity
  • Family Benefits, including bonding and family care leaves, adoption and surrogacy benefits
  • Health Savings, Health Care, Dependent Care and Commuter Spending Accounts
  • In addition to annual Paid Time Off, we offer up to two days of paid leave each to participate in Employee Resource Groups and to volunteer with your charity of choice
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service