Lead Site Reliability Engineer

JPMorgan Chase & Co.New York, NY
1d

About The Position

As a Site Reliability Engineering at JPMorgan Chase within the Enterprise technology, liquidity risk team, you are the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your team’s strategic planning, driving continual improvement in customer experience, resiliency, security, scalability, monitoring, instrumentation, and automation of the software in your area. You act in a blameless, data-driven manner and navigate difficult situations with composure and tact. Chase is a leading financial services firm, helping nearly half of America’s households and small businesses achieve their financial goals through a broad range of financial products. Our mission is to create engaged, lifelong relationships and put our customers at the heart of everything we do. We also help small businesses, nonprofits and cities grow, delivering solutions to solve all their financial needs.

Requirements

  • Formal training or certification in software engineering concepts plus 5+ years of applied experience
  • Advanced knowledge of SRE principles and a track record of implementing SRE across application and platform teams while avoiding common pitfalls
  • Experience leading technologists to manage and resolve complex technology issues at a firmwide level
  • Ability to influence team culture by championing innovation and driving change
  • Experience hiring, developing, and recognizing talent
  • Proficiency in at least one programming language (preferred: JavaScript, Go, Python)
  • Hands-on experience with CI/CD tools (e.g., Jenkins, GitLab, Terraform)
  • Experience with containers and orchestration (e.g., Docker, Kubernetes, ECS)
  • Strong troubleshooting skills across common networking technologies and issues
  • Working knowledge of modern service and integration patterns, including GraphQL fundamentals, event-driven architecture (Kafka or equivalent), and observability/telemetry with OpenTelemetry

Nice To Haves

  • Ability to code, troubleshoot, and demonstrate strong data fluency

Responsibilities

  • Lead SRE practices that balance delivery speed, efficiency, and system stability
  • Partner with engineering peers and senior stakeholders to drive strong, shared outcomes
  • Scale SRE adoption across application and platform teams
  • Set reliability expectations and show progress through stability and reliability metrics
  • Run blameless, data-driven post-incident reviews and regular debriefs to turn lessons into improvements
  • Build a continuous-improvement culture by gathering feedback and improving the customer experience
  • Coach entry- to mid-level engineers and promote knowledge sharing through internal forums and communities

Benefits

  • We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions.  We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service