About The Position

Adobe’s RealTime Customer Data Platform (RTCDP) powers personalized customer experiences for some of the world’s largest brands. As a Senior SRE Technical Lead / Architect , you will play a critical role in ensuring RTCDP’s reliability, scalability, and operational excellence at a global scale. This is a high ownership , high impact role that bridges production operations (Day2 ownership) and core datastore engineering . You’ll shape how RTCDP is built, operated , and scaled—while directly owning production reliability for one of Adobe’s most customer- visible platforms.

Requirements

  • Senior SRE, infrastructure, or platform engineer with 10+ yrs of experience running large scale distributed systems
  • Strong background in datastores, reliability engineering, and automation
  • Proven experience leading production incidents and driving operational improvements
  • Comfortable operating with high ownership, ambiguity, and scale
  • Passion for building resilient systems and mentoring others

Responsibilities

  • Own Production Reliability
  • Own day 2 production reliability for RTCDP, ensuring availability, performance, and durability aligned with SLOs.
  • Serve as a technical lead during incidents , driving mitigation, recovery, and post incident analysis for SEV3 through SEV1 (CSO) events.
  • Improve incident response processes, on - call health, and operational readiness.
  • Partner with product and platform teams to ensure production ready launches and regional expansions .
  • Lead Core Datastore Strategy
  • Provide technical leadership for RTCDP’s core datastores , including: Aerospike FoundationDB Postgres CosmosDB DynamoDB
  • Drive reliability, scalability, upgrade, backup/restore, and disaster recovery strategies.
  • Lead or influence datastore automation (provisioning, scaling, upgrades, benchmarking, lifecycle management).
  • Contribute to cost optimization and efficiency improvements through rightsizing and architectural enhancements.
  • Architect for Scale & Automation
  • Design and build automation first solutions to reduce toil and improve system safety.
  • Influence architecture decisions that improve reliability, scalability, and operability .
  • Build and evolve monitoring, alerting, and observability focused on real customer impact.
  • Promote consistent operational patterns across services, teams, and regions.
  • Technical Leadership & Influence
  • Act as a senior technical authority within the SRE organization.
  • Mentor engineers and influence teams across geographies without relying on direct ownership.
  • Partner closely with engineering, infrastructure, and security teams.
  • Champion strong SRE and DevOps principles : ownership, automation, error budgets, and continuous improvement.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service