Senior Applications Support Specialist

Ensono
$85,000 - $111,000Hybrid

About The Position

The L3 ARO Engineer ensures end‑to-end reliability, resilience, and performance of critical applications. This role acts as the final technical escalation point, leads major incidents, performs deep diagnostics (especially for Java-based systems), drives permanent fixes, and influences architecture, automation, and operational standards. The engineer mentors L1/L2 teams and partners closely with Development, Architecture, Platform, and Security.

Requirements

  • Strong knowledge of application architecture, distributed systems, and middleware.
  • Java expertise: JVM internals, GC, memory management, thread/heap dump analysis, performance tuning.
  • Strong Unix/Linux, networking basics, and advanced scripting (Shell/Python/PowerShell/VBS).
  • Advanced SQL and understanding of databases; Autosys (or equivalent scheduler).
  • Hands‑on with observability tools: Splunk, AppDynamics/Dynatrace, ELK, Grafana, Prometheus.
  • Major incident leadership, deep RCA, change/release readiness, DR & resilience engineering.
  • Experience in regulated production environments.
  • Strong technical leadership and decision‑making.
  • Clear communication during high‑pressure incidents.
  • Ownership mindset and business awareness.
  • 7–12+ years in Application Reliability, Production Support, SRE, or platform operations.
  • Bachelor’s degree in Computer Science/Engineering or equivalent.

Nice To Haves

  • ITIL, cloud, or industry certifications (preferred).
  • Banking/financial domain experience (preferred).

Responsibilities

  • Lead major incident (MI) bridges and restore service with minimum business impact.
  • Handle all L3 escalations, perform deep diagnostics across Java, JVM, middleware, OS, and infra.
  • Own technical RCAs, drive long‑term and systemic remediation.
  • Identify recurring failure patterns and risks.
  • Apply SRE principles: SLIs/SLOs, error budgets, resilience patterns.
  • Tune JVM parameters, analyze thread/heap dumps, and improve performance.
  • Influence application architecture for fault tolerance, scalability, and recoverability.
  • Validate DR readiness, failover behavior, and resilience testing outcomes.
  • Provide technical approval and risk assessment for high-risk changes.
  • Enforce operational readiness for new apps and major releases.
  • Ensure changes meet audit, compliance, and regulatory expectations.
  • Build advanced automation using Shell/Python/PowerShell.
  • Develop frameworks for health validation, automated recovery, and compliance checks.
  • Define observability standards; optimize alerts and improve MTTR.
  • Mentor L1/L2 teams; review and approve runbooks, SOPs, and KB articles.
  • Act as a trusted technical advisor to stakeholders and leadership.

Benefits

  • Unlimited Paid Days Off
  • Three health plan options
  • 401k with company match
  • Eligibility for dental, vision, short and long-term disability, life and AD&D coverage, and flexible spending accounts
  • Family Forming Benefit including fertility coverage and adoption/surrogacy reimbursement
  • Paid childbearing and paternal leave
  • Education Reimbursement, Student Loan Assistance or 529 College Funding
  • Sabbatical leave
  • Wellness program
  • Flexible work schedule

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

501-1,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service