Cybersecurity SRE Platform Manager

Wells Fargo & CompanyColumbus, OH
$119,000 - $187,000Hybrid

About The Position

Wells Fargo is seeking a Cybersecurity Platform SRE Manager to take ownership of the reliability, resilience, and operational excellence of mission‑critical cybersecurity platforms supporting a 24×7 global enterprise. This is a leadership role for someone who cares deeply about system health, takes accountability for outcomes, and enjoys solving complex operational challenges at scale. You will lead a team of Site Reliability Engineers responsible for keeping core security services stable, performant, and secure under real‑world conditions. The work spans incident response, automation, observability, and continuous improvement, with a strong emphasis on building durable systems and proactively reducing risk. You will partner closely with Cybersecurity, Cloud, Infrastructure, and application teams to set reliability standards, strengthen platform resilience, and drive operational discipline across the service lifecycle. This role is ideal for a leader with experience in site reliability engineering or large‑scale production environments who thrives on ownership, values clear accountability, and is energized by improving how critical security platforms are built, operated, and supported at enterprise scale.

Requirements

  • 5+ years of Information Security Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 2+ years of Leadership experience
  • Experience operating enterprise‑scale cybersecurity or critical infrastructure platforms in a 24×7 environment
  • Working knowledge of SRE principles including SLIs/SLOs, error budgets, incident response, and post‑incident reviews
  • Hands‑on experience with cloud platforms such as AWS, Azure, and/or GCP
  • Familiarity with automation, CI/CD pipelines, and infrastructure‑as‑code tools such as Terraform, ARM, or CloudFormation
  • Experience supporting change management, audit activities, and operational controls in regulated environments
  • Strong communication and collaboration skills across engineering, security, and risk teams

Nice To Haves

  • Exposure to cybersecurity platforms such as identity services, secrets management, authentication systems, and endpoint or infrastructure security tooling
  • Experience with containerized environments and orchestration platforms such as Kubernetes
  • Familiarity with observability tools including Prometheus, Grafana, Splunk, Elastic, or OpenTelemetry
  • Scripting or programming experience in Python, Bash, PowerShell, or Go
  • Knowledge of Zero Trust concepts, strong authentication, and secure service design
  • Experience in financial services or other highly regulated industries
  • Relevant certifications such as CISSP, CISM, cloud, DevOps, or SRE certifications

Responsibilities

  • Own cybersecurity platform reliability: Define and track SLIs/SLOs, capacity plans, and reliability improvements to ensure availability, performance, and resilience across cybersecurity platforms such as identity services, secrets management, authentication services, and security tooling.
  • Lead a 24×7 operational model: Manage on‑call rotations, incident response, and escalation processes; participate in Major Incident Management and ensure timely communication and follow‑through.
  • Engineer for resiliency: Guide the team in failure mode analysis, resiliency testing, high‑availability architectures, backup and recovery validation, and disaster recovery exercises.
  • Drive automation and operations as code: Promote CI/CD, infrastructure as code, automated health checks, configuration management, and self‑service tooling to reduce operational toil.
  • Strengthen observability: Establish consistent logging, metrics, dashboards, alerting, and service health reporting using modern observability platforms.
  • Manage risk and compliance obligations: Partner with Risk and Compliance teams to support audit readiness, change governance, access controls, and regulatory expectations such as SOX, FFIEC, GLBA, and PCI, where applicable.
  • Lead and develop engineers: Manage, coach, and grow a team of SRE and platform engineers; set clear goals, support skill development, and foster a blameless, learning‑oriented culture.
  • Partner across the organization: Collaborate with platform owners, security architects, cloud teams, and application teams to align reliability priorities and deliver stable, scalable cybersecurity services.
  • Report on outcomes: Communicate operational health, incidents, risks, and improvement initiatives through clear metrics and leadership‑ready updates.

Benefits

  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Scholarships for dependent children
  • Adoption reimbursement

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Manager

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service