Analyst, Application Engineer

BlackRockWilmington, DE
$90,250 - $110,000Hybrid

About The Position

About the Role You can work with us at one of the top FinTech companies. We sell our Aladdin platform to over 200 of the world’s leading financial institutions, collectively managing approximately a quarter of global assets under management. BlackRock is a global but close‑knit organization united by a common goal: delivering the highest level of service to our business partners and customers. We value diversity of thought, background, and experience, and we invest seriously in our people through flexible time off, collaborative work environments, and strong career development opportunities. In this role, you will support business‑critical computing workloads, real‑time and batch processing, data transfer services, application onboarding and upgrades, and recovery procedures. You will work as part of a globally distributed team operating 24x7x365 to ensure the stability and reliability of production environments. This position provides hands‑on exposure to large‑scale production systems and modern operational practices, including automation, observability, and AI‑assisted operations. If this excites you, we’d like to talk. Team Overview The Service Management Operations Group monitors, supports, and administers production environments for all BlackRock businesses (including subsidiaries and BlackRock Solutions). The group acts as a first responder for incident detection, troubleshooting, resolution, and escalation. You will collaborate with experienced professionals across regions and technologies, gaining exposure to a broad range of platforms and applications while contributing to service quality, reliability, and continuous operational improvement as part of the One BlackRock culture.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, Information Systems, or equivalent practical experience.
  • 0–3 years of experience in production support, service management, operations, DevOps, or related technical roles.
  • Basic familiarity with Linux/Unix systems, networking concepts, and distributed applications.
  • Exposure to monitoring/observability tools (metrics, logs, dashboards, alerts).
  • Understanding of incident management fundamentals and IT service management concepts.
  • Interest in automation, scripting, and modern operational practices (e.g., reliability, resiliency).
  • Strong analytical skills, attention to detail, and ability to follow structured processes.
  • Clear written and verbal communication skills; comfortable working in a global, follow‑the‑sun team.
  • Willingness to learn, adapt, and operate in a 24x7 production support environment.

Responsibilities

  • Support reliability and availability of production systems Monitor production environments and respond to alerts and incidents according to documented procedures. Assist in maintaining availability, performance, and recovery objectives for critical workloads. Participate in incident reviews and help document root causes and follow‑up actions.
  • Operate signal‑driven monitoring and alerting Use monitoring and observability tools to identify system health issues and potential risks. Help validate alerts and distinguish real production impact from noise. Escalate issues appropriately based on impact, urgency, and runbooks.
  • Execute automation and runbooks (automation‑aware) Execute scripted remediation and automation for known failure scenarios. Follow defined guardrails, approvals, and audit requirements when performing recovery actions. Identify recurring manual tasks or failure patterns and suggest candidates for automation.
  • Partner with engineering on operability Work with engineering teams during deployments, upgrades, and production readiness activities. Provide operational feedback on monitoring gaps, documentation quality, and supportability issues. Help ensure services are observable, recoverable, and supportable in production.
  • Change‑aware production support Support change implementation by monitoring post‑change system behavior and health. Assist with capacity, resilience, and disaster recovery activities such as testing and exercises. Follow disciplined change and incident management processes.
  • Documentation, evidence, and operational hygiene Maintain accurate incident records, handover notes, and operational documentation. Support audit and compliance requests by gathering logs, metrics, and operational evidence. Contribute to continuous improvement through post‑incident learnings and process updates.

Benefits

  • employees are eligible for an annual discretionary bonus, and benefits including healthcare, leave benefits, and retirement benefits
  • strong retirement plan
  • tuition reimbursement
  • comprehensive healthcare
  • support for working parents
  • Flexible Time Off (FTO)
  • BlackRock’s hybrid work model is designed to enable a culture of collaboration and apprenticeship that enriches the experience of our employees, while supporting flexibility for all.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service