Senior Site Reliability Engineer

Adobe•San Jose, CA

About The Position

We are seeking a Senior SRE (Site Reliability Engineer) to help compose, build, and operate highly scalable, secure, and resilient cloud platforms. We are redefining this role to focus on product and platform engineering. This position is a core builder role within the Developer Platform organization. It is responsible for crafting and evolving cloud platforms as internal products with clear customers, roadmaps, and outcomes. The ideal candidate thinks like a product engineer first: crafting scalable, secure, and reliable platforms that empower thousands of engineers to move faster with less operational burden. Reliability, security, and operability are built into the platform, not bolted on. This role is critical to preparing the organization for the next generation of cloud-native and AI-enabled platforms.

Requirements

Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience.
10+ years of experience in platform engineering, cloud infrastructure, SRE, or large-scale distributed systems.
Strong experience crafting and operating AWS-based platforms.
Deep understanding of Kubernetes, container platforms, and cloud-native architectures.
Proven ability to build systems with reliability, security, and operability as core development constraints.
Strong programming skills (Python, Go, Java, or similar).
Practical experience with Infrastructure as Code (Terraform or equivalent).
Experience building shared platforms or internal products consumed by multiple teams.
Ability to think in terms of systems, tradeoffs, and long-term platform evolution rather than short-term fixes.

Responsibilities

Own platform capabilities end-to-end: architecture, APIs, operability, lifecycle management, and developer experience.
Translate reliability, scalability, and security requirements into reusable platform primitives and services.
Drive platform adoption by making the "right path" the easiest path for product teams.
Define reliability, availability, and performance targets as outstanding product requirements.
Build standardized reliability patterns (golden paths, reference architectures, guardrails) that product teams inherit by default.
Build for failure through multi-region architectures, graceful degradation, and automated recovery.
Build self-service workflows, tooling, and APIs that abstract infrastructure complexity away from application teams.
Apply Infrastructure as Code and policy-as-code to guarantee consistency, safety, and scalability.
Integrate CI/CD, provisioning, and operational workflows into a cohesive developer experience.
Embed secure-by-design principles into infrastructure architecture and delivery.
Implement secure networking, encryption, secret management, and access controls.
Partner with security teams to maintain compliance with enterprise and regulatory standards.
Proactively reduce risk through architecture reviews and security hardening.
Contribute to platform capabilities that leverage AI for operational intelligence and automation.
Develop scalable foundations that support the gradual addition of diagnostics, remediation, and insights based on intelligent systems into the platform.
Work closely with product engineering teams to understand difficulties and evolve platform capabilities accordingly.
Act as a technical leader and advisor on cloud-native build, reliability, and platform usage.
Produce high-quality build documents and reference implementations that scale knowledge across the organization.