Systems Operations Senior Manager - SRE

Wells FargoCharlotte, NC
Hybrid

About The Position

The Systems Operations Senior Manager (SRE) is a people‑first leader responsible for building, developing, and scaling high‑performing operations teams supporting modern data, analytics, BI, and AI platforms. This role fosters a culture of reliability engineering, accountability, and continuous improvement while coaching managers and engineers to deliver consistent, high‑quality operational outcomes. Through strong stakeholder engagement and clear operating rhythms, this leader aligns teams around shared objectives, transparent communication, and measurable impact.

Requirements

  • 7+ years of experience in Systems Engineering and Technology Architecture, or equivalent demonstrated through work experience, training, military experience, or education.
  • 3+ years of management or leadership experience
  • Demonstrated experience with Site Reliability Engineering (SRE) practices, including reliability, observability, error budgets, and toil reduction
  • Hands‑on experience using automation to support SRE‑driven operations, including automation frameworks, orchestration tools, cloud platforms, and CI/CD pipelines
  • Familiarity with modern UI frameworks, automated testing tools, service management platforms, and platform telemetry solutions
  • Proven ability to influence across organizational boundaries and communicate effectively with both technical and non-technical stakeholders.

Nice To Haves

  • Drive platform modernization including cloud readiness, operational tooling, and intelligent automation.
  • Reduce manual toil through workflow automation, standardized operating models, and self-service platforms to improve service predictability.
  • Balance rapid experimentation with enterprise‑grade controls, compliance, and risk management.
  • Serve as a role model for strong leadership behaviors and ethical AI practices.

Responsibilities

  • Manage, coach, and develop teams of analysts, associates, and less experienced managers providing technical services and platform operations support.
  • Build leadership capability and guide a culture of accountability, continuous improvement, and talent development aligned to business strategy.
  • Manage allocation of people and financial resources to meet operational and strategic objectives.
  • Collaborate with and influence professionals at all levels across technology and business organizations.
  • Apply Site Reliability Engineering (SRE) principles as the primary operating model across modern data, analytics, BI, and AI platforms.
  • Implement, enable, lead, and scale SRE operations to ensure platform reliability, performance, scalability, and cost efficiency.
  • Leverage automation to eliminate manual toil, improve operational consistency, and enable predictable, repeatable service delivery.
  • Drive automation across monitoring, incident response, maintenance, and self‑service operational capabilities in support of SRE objectives.
  • Partner with data engineering, platform engineering, and application teams to strengthen reliability, observability, and operational maturity.
  • Identify and recommend opportunities to enhance remote monitoring, management tooling, and periodic system reviews.
  • Define, track, measure, and report operational OKRs, platform KPIs, and service health metrics aligned to business outcomes.
  • Use operational data, trends, and insights to prioritize improvements, reduce friction, and increase platform adoption and trust.
  • Establish a disciplined, metrics‑driven operating rhythm including service health reviews and continuous improvement cycles.
  • Engage and influence stakeholders, internal partners, and peers to deliver platform enhancements and operational improvements.
  • Ensure proactive, transparent communication regarding incidents, service performance, planned changes, and outages.
  • Determine appropriate operational strategies to meet moderate to high‑risk deliverables.
  • Interpret, develop, and implement policies and procedures aligned to compliance, risk, and control requirements.
  • Perform network assessments, security audits, and system enhancement consultations.
  • Provide implementation support for key enterprise risk initiatives.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Manager

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service