LaunchCode-posted about 1 month ago
$26 - $48/Yr
Full-time • Mid Level
Remote • Saint Louis, MO
51-100 employees
Professional, Scientific, and Technical Services

This role requires expertise in Dynatrace, AWS (including containerized platforms such as ECS and EKS), ServiceNow, Terraform, and Ansible. They will partner with FNBO's service and application owners to review architecture diagrams, assess operational workflows, and lead knowledge transfer sessions to ensure long-term sustainment.

  • Monitoring & Alerting Audit
  • Assess current monitoring and alerting configurations.
  • Identify and report on coverage gaps, alert fatigue issues, and improvement areas.
  • Monitoring Architecture Design
  • Develop and document for monitoring and alerting architecture standards.
  • Map applications to services, infrastructure components, and dependent platforms.
  • Dynatrace Configuration and Instrumentation
  • Implement end-to-end Dynatrace monitoring for the top 10 services, including:
  • Synthetic transactions
  • Real User Monitoring (RUM)
  • Grail queries
  • Site Reliability Guardian and workflows
  • Instrument Java applications and AWS-hosted workloads (ECS, EKS).
  • Alert Framework and Automation
  • Configure alerts with defined thresholds and business impact criteria.
  • Integrate alerts into ServiceNow for ticket creation and escalation.
  • Implement automated remediation for repeatable failures using Terraform and Ansible. 5.
  • Business Impact Dashboards
  • Build dashboards that provide visibility into business impacts when system are degrade or during an outage.
  • Tailor views for business, operations, and technical stakeholders.
  • Automated Healing Capabilities
  • Develop and deploy recovery automation:
  • Example: Restarting failed services, Re-provisioning or scaling containers
  • Demonstrate automation in at least three production scenarios.
  • Documentation and Knowledge Transfer
  • Provide clear documentation for all configurations, dashboards, and automation.
  • Lead working sessions to train FNBO teams on monitoring, dashboarding, and automated recovery.
  • Minimum 3+ years' experience with Dynatrace SaaS (required) implementation and.
  • Dynatrace Skills: Grail - queries, Workflows, RUM, Site Reliability Guardian, SLO, business events, other Gen 3 UI features (not just classic)
  • Strong proficiency in AWS infrastructure monitoring and CloudWatch
  • Advanced Java development skills with focus on application performance monitoring
  • Experience designing and implementing enterprise monitoring dashboards
  • Background in financial services technology preferred
  • Omaha, NE (preferred). At least a 4 hour overlap with CST.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service