Site Reliability Engineer III

JPMorgan Chase & Co.•New York, NY

About The Position

As a Site Reliability Engineering at JPMorgan Chase within the Enterprise technology, liquidity risk team, you are the non-functional requirement owner and champion for the applications in your remit. You are a key influencer in your team’s strategic planning, driving continual improvement in customer experience, resiliency, security, scalability, monitoring, instrumentation, and automation of the software in your area. You act in a blameless, data-driven manner and navigate difficult situations with composure and tact.

Requirements

Formal training or certification on software engineering concepts and 5+ years applied experience
Advanced SRE knowledge and a proven track record implementing SRE practices across application and platform teams (including avoiding common pitfalls)
Experience leading technologists to resolve complex, firmwide technology issues
Ability to influence team culture by championing innovation and change
Experience hiring, developing, and recognizing talent
Proficiency in at least one programming language, with preference for JavaScript, Go, or Python
Hands-on experience with CI/CD tools (e.g., Jenkins, GitLab, Terraform)
Experience with containers and orchestration (e.g., Docker, Kubernetes, ECS)
Troubleshooting experience with common networking technologies and issues
Strong fundamentals across modern architectures and observability, including GraphQL (schema design, federation/supergraph), event-driven systems (Kafka concepts like partitions/consumer groups, DLQs, replay), microservices patterns (API gateways/routers, CQRS/event sourcing), and end-to-end telemetry using OpenTelemetry (metrics/logs/traces)

Nice To Haves

Strong hands-on ability to code and troubleshoot, with solid data fluency

Responsibilities

Lead SRE adoption across teams, balancing feature delivery with efficiency and system stability
Partner with peers and senior stakeholders to align on reliability goals and make trade-offs that improve outcomes
Set and track reliability and stability metrics, and use data to drive measurable improvements
Build a continuous-improvement culture by collecting real-time feedback and turning it into customer-impacting changes
Coordinate with other teams to share solutions and prevent duplicated work
Run blameless, data-driven post-mortems and regular debriefs to turn incidents (and wins) into learning
Coach and develop entry- to mid-level engineers through hands-on guidance and feedback

Benefits

We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume