Site Reliability Engineer - Dynatrace - 6158762

AccentureKirkland, WA
1d$63 - $73Onsite

About The Position

Accenture Flex offers you the flexibility of local fixed-duration project-based work powered by Accenture, a leading global professional services company. Accenture is consistently recognized on FORTUNE's 100 Best Companies to Work For and Diversity Inc's Top 50 Companies For Diversity lists. As an Accenture Flex employee, you will apply your skills and experience to help drive business transformation for leading organizations and communities. In addition to delivering innovative solutions for Accenture's clients, you will work with a highly skilled, diverse network of people across Accenture businesses who are using the latest emerging technologies to address today's biggest business challenges. You will receive competitive rewards and access to benefits programs and world-class learning resources. Accenture Flex employees work in their local metro area onsite at the project, significantly reducing and/or eliminating the demands to travel. Job Description: We are seeking a Site Reliability Engineer with expertise in Dynatrace for our client. The SRE will be responsible for platform monitoring, reliability, and operational resilience across the Cloud Core ecosystem, including core banking platforms, integration layers, and downstream services. This role focuses on ensuring availability, performance, observability, and incident response readiness in a highly regulated financial services environment. This role requires Hands-on expertise in Dynatrace, including end-to-end configuration, dashboard creation, alert tuning, and root cause analysis.

Requirements

  • Minimum of 3 years of experience with Dynatrace
  • Minimum of 3 years of hands-on experience with cloud-native platforms (Azure preferred; AWS/GCP acceptable)
  • Hands-on expertise in Dynatrace, including end-to-end configuration, dashboard creation, alert tuning, and root cause analysis.

Nice To Haves

  • Strong understanding of distributed systems, microservices, and event-driven architectures.
  • Experience supporting mission-critical platforms with high availability requirements.
  • Proven experience with monitoring and observability tools such as Dynatrace, Azure Monitor, App Insights, Prometheus, Grafana, Splunk, Datadog, or equivalent.
  • Experience designing actionable alerts and reducing alert fatigue.
  • Strong understanding of logs, metrics, and traces and how to correlate them during incidents.

Responsibilities

  • Hands-on work in Dynatrace, including end-to-end configuration, dashboard creation, alert tuning, and root cause analysis.
  • Design, implement, and operate end-to-end monitoring and observability for Cloud Core platforms, including core banking, integrations, and supporting services.
  • Define and manage SLIs, SLOs, and error budgets aligned to business-critical banking services.
  • Monitor platform health across availability, latency, throughput, and error rates, proactively identifying reliability risks.
  • Lead and support incident management, including triage, root cause analysis (RCA), and post-incident reviews.
  • Partner with application, integration, data, and infrastructure teams to embed reliability into system design and delivery.
  • Automate operational tasks and monitoring workflows to reduce manual intervention and mean time to recovery (MTTR).
  • Support release readiness and change management, ensuring observability and rollback considerations are in place before production deployments.
  • Establish dashboards and reporting for operational visibility across technology and business stakeholders.

Benefits

  • Accenture offers a market competitive suite of benefits including medical, dental, vision, life, and long-term disability coverage, a 401(k) plan, bonus opportunities, paid holidays, and paid time off.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service