Manager, SRE Risk Advisory and Oversight

Capital OneMcLean, VA
Remote

About The Position

Technology & Data Risk Management (TDRM) is a small organization that packs a big punch. The ~200 professionals in TDRM are trusted experts who oversee ~14,000 developers at Capital One. We raise the bar for excellence in cybersecurity, reliability, tech risk, and data management risk. We shape strategy and decisions, challenge activities to ensure they meet our standards, and perform independent tests of our security and technology risk. For years, the cybersecurity community has debated whether the CISO should report to the CIO or not. In regulated financial services, the answer is: both. The first-line CISO has operational responsibilities and reports to the CIO. The second-line Chief Tech Risk Officer (CTRO) and the Tech & Data Risk Management (TDRM) organization have broader responsibilities for cybersecurity but also reliability, software quality, resilience, and the risk of failing to manage our data. The CTRO is independent and oversees the work of the CISO, the CIO/CTO, and the Chief Data Officer. The CTRO reports to the Chief Risk Officer, who reports directly to the CEO. Our business leaders must constantly make technology decisions. TDRM makes sure they have the tech and data risk information they need to make good decisions. Associates within TDRM are highly-skilled information security, cybersecurity, site reliability engineering, technology, data analyst, data scientist, and risk management professionals. They have a wealth of experience and a demonstrated ability to add value with their advice and to deliver high-impact results. As the Manager, SRE Risk Advisory and Oversight , you will serve as a technical expert providing second-line oversight and effective challenge over Capital One’s software engineering and Site Reliability Engineering (SRE) practices. This is an advisory-based role where you will leverage your foundational engineering background to conduct deep-dive technical risk analyses and assessments on cloud implementations, resilience architectures, and emerging automation capabilities (including Generative AI toolsets). You will perform the core technical analytical work and partner closely with Sr. Managers and Directors to synthesize risk findings, develop strategic mitigation recommendations, and build compelling, data-driven narrative materials for executive storytelling.

Requirements

  • Bachelor’s Degree or military experience
  • At least 4 years of experience in Technology Management, Software Engineering, Site Reliability Engineering, or Cyber Risk Management
  • At least 2 years of experience with cloud implementations (AWS, GCP, or Azure)
  • At least 1 year of experience with open-source programming languages

Nice To Haves

  • Master’s Degree in Computer Science, Computer Engineering, or a relevant technical discipline.
  • Professional cloud or infrastructure certification (AWS Certified Solutions Architect, AWS SysOps Administrator).
  • Experience analyzing or utilizing enterprise monitoring, observability, and alerting toolsets (Splunk, Prometheus, Datadog, ELK, PagerDuty).
  • Demonstrated understanding of cloud-native systems, containerization stacks (Kubernetes), and CI/CD pipelines.
  • Proven experience drafting technical assessments or presentation materials used to communicate technical findings to senior leadership.
  • Strong communication and interpersonal skills, with the ability to influence and drive technical alignment across stakeholder groups.
  • Prior experience working within financial services or another highly-regulated industry.

Responsibilities

  • Perform Deep-Dive Risk Analysis: Conduct independent, technical risk assessments of cloud infrastructure architectures, software delivery lifecycles, and observability frameworks to identify systemic resilience and stability risks.
  • Support Effective Challenge: Evaluate first-line cloud engineering practices against enterprise risk appetites, ensuring robust strategies are maintained for automation, system resiliency, performance, and monitoring.
  • Build Storytelling & Reporting Materials: Partner with team leadership (Sr. Managers and Directors) to translate complex, highly technical engineering data into structured risk reports, presentation decks, and executive storytelling materials.
  • SRE Subject Matter Expertise: Serve as a trusted technical analyst on core SRE pillars, assessing the design and maturity of Service Level Indicators/Objectives (SLIs/SLOs), error budgets, release pipelines (CI/CD), and toil reduction efforts.
  • Evaluate AI & Tech Integration: Actively evaluate the integration of cutting-edge technologies—specifically cloud-native stacks, containerization, and the application of emerging Gen AI/ML tooling within software delivery—to ensure reliable operational boundaries.
  • Formulate Risk Recommendations: Collaborate across the second line of defense to design, adjust, and recommend appropriate mitigating controls and guardrails for emerging cloud tech.
  • Stakeholder Partnership: Build and maintain collaborative relationships with first-line engineers, architects, and technical owners to ensure risk assessments are thoroughly understood and communicated transparently.

Benefits

  • comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service