About The Position

PepsiCo’s Digital Transformation requires new business processes, new digital products and new operations outcomes. This high velocity Digital Transformation necessitates Effective, Modern & Resilient operations in an SRE construct for all the programs under TS& EP Portfolio, per the main purpose to drive higher order outcomes to our customers who use our our tools / products to achieve their business process resiliency enabled via a full SRE Practice incident prevention / proactive resolution model in the 200+ application portfolio of Cloud native, Traditional, COTS & SAAS solutions. To bring this to life, we are looking for a self-driven, forward thinking Principal Solution Architect – SRE Application Design & Cross Product Resiliency Solutions, an advanced subject matter expert of application architectures both traditional on-prem and modern cloud native, Seasoned leader of SRE application engineering to Design & enable new shift left activities building a world-class AI-ready function, "SRE @Design", that embeds resiliency within the product, cross product interactions & Business needs ensuring PepsiCo’s digital ecosystem stays ahead of high-velocity market changes. Oversight into production, as continuous improvements, of the SRE Orchestration of anomalies & resiliency actions preventing P1/P2/P3 &Customer impact items across the end2end ecosystem achieving business / user KPIs.

Requirements

  • A Bachelor’s Degree in Computer Science, Engineering or a related field, preferably a Master’s or PhD in Computer Science or Engineering preferred
  • 16-20 years of experience working within a cross-functional Technology organization in partnership with Sector Leads, Global Software Engineering, Enterprise/Solution Architecture, Information Security.
  • Minimum of 10-12 years of software development, Solution Architect, engineering management practice delivering Digital Enablement (Digital Products, Strategy, Operations, etc) across a Global Technology Function especially towards a north star transformation
  • 8+ years of Hand-on experience in Java and Spring / SpringBoot ecosystem, JUnit, MicroServices, Serverless Computing, Rest APIs, Kubernetes, Kafka, Data Pipelines, data products, EBI
  • A seasoned application solution architect with 5+ years of experience as a Cloud & on-prem architecture in hyperscale Public Cloud, Azure, Amazon Web Services and cloud specific PaaS and SaaS solutions
  • Commanding knowledge of data structures, algorithms, and object-oriented design & Working knowledge of programming languages beyond Java, C or C++ (e.g. Ruby, Python, Perl)
  • A firm understanding of SRE (Software Reliability Engineering) and IT Service Management (ITSM) processes with a track record for improving service offerings - resolving incidents, providing a seamless customer/end-user experience and proactively identifying and mitigating areas of risk
  • Experience in designing Resilience & Failure Modelling - early-stage threat modelling & Analysis to build self-healing systems
  • Governance & blueprinting organizational technical standards and compliance early to avoid late-stage rework
  • Solution guidelines reducing change related incidents in production via Monitoring, Outage, Roll back, Business Effect, Integrations Impact, Smoke Test
  • The ideal leader will be highly quantitative, have great judgment, able to connect dots across workstreams, and efficiently work cross-functionally across teams to ensure SRE orchestrating solutions are meeting customer/end-user expectations
  • Good understanding of Business Process Management, Rule based processing, Enterprise application design, Business Domain Model definition & UML Patterns.
  • A true product mindset, Technical Influencing across Stakeholders with ability to build cross-functional relationships through trust, respect, and partnership
  • Core Technical exposure in Application solutions
  • AKS, Kubernetes, Terraform, Azure API Management, Azure Functions, backend SQL, No SQL, App Dynamics Elastic, Prometheus, Grafana, OpenTelemetry, Key Vault
  • Languages - C#, Java, ASP.Net, API, Restful, Micro service. ReactJS,
  • Frameworks : .net core JDK, spring boots, WCF, Web API,
  • Database: Postgres, MySQL, MemSQL, SQL Server, Oracle, Cloudera, NoSQLCloud Platform: Azure, AWS, Google
  • Testing Tools: JMeter, Selenium, QTP 9.0, Load Runner 6.0
  • Integration Servers: Biztalk, Axway, Boomi,
  • Workload Automation, e-Gate, Mirth

Nice To Haves

  • A fast and fearless leader, learner and team player that embraces a transformational mindset and company’s core values better, stronger and faster.
  • Ability to identify patterns, process information quickly, and make decisions in a timely manner – analytical mindset
  • Ability to create new solutions to an old problems, even when greeted with opposition – innovative mindset
  • Ability to greet each new day, project, escalation or setback as an opportunity to improve – positive mindset
  • Ability to stand up for what is right and most advantageous for the company, its employees and its shareholders – fearless mindset

Responsibilities

  • Reporting directly to the Sr. Director, Platform Solution Engineering - SRE Practice
  • Is accountable & responsible to drive the SRE Engineering activities within the application(s) design and architecting the SRE Cross product resiliency solutions that enables seamless pre-emptive & proactive detect, diagnose & recover minimizing the impact operating in an ever evolving landscape of product offerings to deliver first class service to our customers
  • Accountable to Engage, influence & collaborate across Engineering, Product, Architecture, Operations defining & achieving end2end output resiliency solutions via engineering standards, SRE principles and practices
  • Solution architect & drive across teams maturing towards Zero nett impact Day 1 operations of change, KPI & Transition through design
  • Be the Technical leader Instituting logging, Non-functional requirements, preventing faulty outcomes’ health checks are baked into the solution architecture through design via new programs or a uplift roadmap to reach maturity.
  • Leads the definition, collection and analysis of data relevant products systems and their interactions towards business process resiliency especially related impacting customer satisfaction, Revenue or IT productivity
  • A key player maturing as best of breed Instrumentation & anomaly standards / practices in the diverse nature of our solutions involving Digital cloud, Data pipelines, Traditional architecture as Transactional data assurance, Health & consumption measures including open telemetry
  • Responsible to architecting & developing SRE orchestration plug-in capabilities leveraging data & AI towards the business outcomes ensuring effective consumption & integration of diagnostics, anomaly detection, and resiliency actions to deliver automated and personalized experiences
  • Seasoned with hands on developing full stack solutions in Java and Spring / SpringBoot ecosystem, JUnit , BackEnd MicroServices, Serverless Computing and REST API’S, along with Postgress and SQL
  • An expert of application solution architectures ensuring we are designing SRE orchestration solution to scale across the cloud native and traditional on-prem application of 200 + apps across Business Processes, Applications, Services (ie. data pipelines) and Infrastructure
  • An integral part of being in front on critical issues driving timely resolution and clear communication with stakeholders.
  • Interact with key business partners to recommend solutions that best meet the strategic needs of the business
  • Influences Product, Engg, UI/UX tech debt prioritization balancing impact, cost & optimization
  • Evolves our organization as Industry leader in SRE & AI Ops capabilities partnering with both internal Enterprise architecture and external SRE Practitioners
  • Evangelism and education throughout S&T with all levels of leadership to ensure awareness and alignment Application Reliability & Assurance objectives and AI initiatives.
  • Operates in an ever-evolving landscape of product offerings to deliver first class service to our customers.

Benefits

  • Bonus based on performance and eligibility target payout is 15% of annual salary paid out annually.
  • Paid time off subject to eligibility, including paid parental leave, vacation, sick, and bereavement.
  • Medical, Dental, Vision, Disability, Health, and Dependent Care Reimbursement Accounts, Employee Assistance Program (EAP), Insurance (Accident, Group Legal, Life), Defined Contribution Retirement Plan.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service