Staff Site Reliability Engineer

VisaAshburn, VA
97d$124,700 - $180,650Hybrid

About The Position

As a Staff Site Reliability Engineering (SRE) team, you will be part of a cross-functional Operations & Infrastructure group responsible for the reliability, availability, performance, and optimization of Visa Spend Clarity for Enterprises (VSCE). You will support teams in running robust applications, lead incident resolution efforts, and drive operational excellence through automation, observability, and platform modernization. This role is critical to Visa's transformation as we scale our product to a broader range of issuers through cloud infrastructure and automation. You will work closely with engineering, operations, and product teams to ensure our systems are resilient, secure, and continuously improving.

Requirements

  • 5 or more years of relevant work experience with a Bachelor's Degree or at least 2 years of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 0 years of work experience with a PhD.
  • Experience with transactional systems (e.g., banking, finance, telecommunications).
  • Proficiency in Microsoft stack (Windows Server, IIS, MS SQL Server).
  • Familiarity with middleware technologies (e.g., MQ, Active Directory, Session State).
  • Advanced experience with AWS cloud services, including designing and troubleshooting scalable, resilient infrastructure.
  • Knowledge of certificate management and secure system design (basic to intermediate level).
  • Strong troubleshooting, performance tuning, and capacity planning skills.
  • Exposure to PCI and other audit/control frameworks.
  • Experience with enterprise monitoring and orchestration tools.
  • Ability to work across time zones and with geographically dispersed teams.
  • Excellent communication, collaboration, and stakeholder management skills.
  • Self-motivated, adaptable, and committed to continuous learning and growth.
  • Experience leading initiatives and influencing across teams.
  • Customer-oriented mindset for both internal and external clients.

Nice To Haves

  • 6 or more years of work experience with a Bachelor's Degree or 4 or more years of relevant experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or up to 3 years of relevant experience with a PhD.

Responsibilities

  • Operate and improve distributed systems and SaaS applications in production environments.
  • Lead and coordinate incident response efforts, ensuring timely resolution and root cause analysis.
  • Collaborate with engineering teams to enhance system reliability, uptime, and performance.
  • Automate operational tasks using scripting and orchestration tools (e.g., PowerShell).
  • Support and configure middleware, load balancers, and Web Application Firewalls.
  • Drive strategic initiatives such as cloud migration and platform modernization.
  • Apply AWS cloud expertise to solve infrastructure problems and scalability challenges.
  • Monitor and manage enterprise systems using observability and alerting tools.
  • Participate in a 24/7/365 On Call rotation, including shift and weekend support as needed.
  • Contribute to internal platform development with a product-led mindset.
  • Ensure secure and compliant software delivery in regulated environments.
  • Support geographically dispersed systems across multiple time zones.
  • Provide support and documentation for task handoffs and transitions.

Benefits

  • Medical
  • Dental
  • Vision
  • 401(k)
  • FSA/HSA
  • Life Insurance
  • Paid Time Off
  • Wellness Program

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Credit Intermediation and Related Activities

Education Level

Bachelor's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service