Wells Fargo is seeking an Observability and Automation Leader to support and provide technology services for the Chief Operating Office Technology (COO Tech) Organization. The Chief Operating Office Technology (COO Tech) organization powers the technology behind some of the company’s most critical enterprise functions—from operational resilience and strategic execution to data, customer experience, supply chain, and shared services. Our mission is to modernize and optimize the platforms that enable the business to operate smoothly, scale confidently, and stay future‑ready. Within COO Technology, the Platform & Application Services team is building the next generation of intelligent, resilient systems—and we’re looking for a Systems Operations Senior Manager (Observability and Automation Leader) to help lead that transformation. In this role, you’ll lead a globally distributed team of engineers and play a pivotal role in shifting the organization from reactive firefighting to proactive, predictive system operations . You’ll champion modern observability practices, introduce advanced tools and automation, and drive the cultural evolution needed to achieve deep system insights and operational excellence at scale. This is an opportunity for a hands‑on, strategic leader who thrives at the intersection of technology, people, and transformation—and who wants to leave a lasting mark on how enterprise systems are built and run. In this role, you will: Develop and implement a maturity model for observability, standardizing data collection and centralizing system telemetry to enhance root cause analysis and proactive system management. Integrate observability tools with incident management and CI/CD pipelines, automating alert tuning and remediation using AI and machine learning for real-time anomaly detection. Define, monitor, and align Service Level Indicators (SLIs) and Service Level Objectives (SLOs) with business outcomes, promoting business observability and user experience insights. Implement distributed tracing to optimize service interactions, especially within microservices architectures, and drive the adoption of observability as code (OaC) for consistency and automation. Continuously optimize system performance and reliability with minimal manual intervention, leveraging automation and predictive management practices. Manage and develop high-performing, technical teams, fostering a culture of talent development, and ensuring effective communication with customers regarding incidents and system changes. Engage and collaborate with stakeholders to engineer projects, identify and implement new solutions, and support key risk initiatives. Oversee network assessments, security audits, system enhancements, and ensure compliance with risk management policies and procedures. Manage allocation of people and financial resources for Systems Operations, staying updated on emerging technologies and best practices in observability, automation, and AIOps.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager
Education Level
No Education Listed
Number of Employees
5,001-10,000 employees