Sr. Manager - Site Reliability Engineer

VisaAshburn, VA
66d$152,200 - $220,850

About The Position

Visa’s Technology Organization is a dynamic community of problem solvers and innovators dedicated to redefining the future of commerce. We manage one of the world’s most advanced processing networks, handling over 65,000 secure transactions per second across 80 million merchants, 15,000 financial institutions, and billions of individuals. By joining us, you will engage with complex distributed systems and address large-scale challenges in new payment flows, business and data solutions, cybersecurity, and B2C platforms. As a Senior Manager, you should bring a well-rounded skill set that includes expertise in Site Reliability Engineering (SRE) principles and practices, cloud platforms (AWS, Azure, Google Cloud), and security protocols. You should have experience with cloud migration, automation tools, and applying Generative AI to improve operational efficiencies. Proficiency in containerization technologies (Docker, Kubernetes), observability tools, and distributed caching systems is essential. As a Technology Manager in Visa’s Operations and Infrastructure division, you will join our Global PRE team to design, enhance, and build a highly available, secure, scalable, and resilient infrastructure in an agile environment. You will collaborate with supportive and challenging colleagues daily. You will take lead roles on projects to ensure the reliability and performance of our services, platforms, RESTful APIs, container-based distributed systems, and cloud services.

Requirements

  • 8 or more years of relevant work experience with a Bachelor Degree or at least 5 years of experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or 2 years of work experience with a PhD.
  • Strong work ethic, self-starter, ability to work in a fast-paced, team-oriented environment, and comfortable working with a global team.
  • 10+ years of relevant work experience and a bachelor’s degree in computer science.
  • 8+ years of experience with JAVA, J2EE applications, and a deep understanding of Web Services technologies: REST & SOAP.
  • 4+ years of experience managing applications on Containers (Docker) and Cloud (AWS, GCP, Azure).
  • Good understanding of Linux, Jenkins, .NET or Java-based web applications, MySQL/MSSQL/Oracle database concepts, Tomcat services, and web applications on Apache.
  • Ability to build deployment scripts and automated solutions using scripting languages such as Shell scripting (Bash), JavaScript, Python, or others.
  • Good understanding of infrastructure components like Linux operating systems, virtual machines, MQ, storage, etc.
  • Knowledge of Generative AI capabilities and use cases to enable such capabilities in the environment.
  • Prior experience working in 24x7 environments.
  • Exceptional analytical and problem-solving skills, along with strong oral and written communication abilities.
  • Proven proficiency in troubleshooting, root-cause analysis, application design, and implementing major components for large projects.
  • Experience building tools to automate production support activities, enhancing the efficiency and productivity of service desk and operations groups.
  • Good understanding and knowledge of observability tools.

Responsibilities

  • Ensure the security and safety of application services and platforms.
  • Maintain zero downtime by swiftly addressing any issues to ensure environments are always operational.
  • Oversee all activities within the environment, including deploying new code.
  • Foster an inclusive, innovative, and collaborative team culture.
  • Build strong partnerships with key stakeholders, including product management, engineering, design, and operations.
  • Communicate effectively with both technical and business partners to create frameworks for discussing complex topics.
  • Regularly analyze the environment and promote the adoption of automation and Generative AI to stay competitive.
  • Lead cloud infrastructure adoption and migration, ensuring a seamless transition with minimal downtime.
  • Run problem bridges by collaborating with different functional and technical teams, escalating issues as needed for timely resolution.
  • Proactively share important context and information with relevant stakeholders.

Benefits

  • Medical
  • Dental
  • Vision
  • 401 (k)
  • FSA/HSA
  • Life Insurance
  • Paid Time Off
  • Wellness Program
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service