About The Position

In the Technology division, we leverage innovation to build the connections and capabilities that power our Firm, enabling our clients and colleagues to redefine markets and shape the future of our communities. This is a Lead Software Engineering position at Vice President level, which is part of the job family responsible for developing and maintaining software solutions that support business needs. Since 1935, Morgan Stanley is known as a global leader in financial services, always evolving and innovating to better serve our clients and our communities in more than 40 countries around the world. Role Summary We are looking for an experienced Technology Manager to lead a Site Reliability Engineering (SRE) team focused on reliability, DevOps, platform modernization, and AI enablement for a centralized data platform supporting Balance Sheet products. You will manage engineering delivery, collaborate with product/data/operations, and drive improvements in platform stability, observability, automation, and deployment, while accelerating modernization and AI operations. Candidates must have hands-on technical expertise, proven leadership, and knowledge of Balance Sheet/Finance products (liquidity, capital, collateral, secured financing, lending, margin/prime brokerage). You’ll develop and execute roadmap using modern SRE practices, DevOps automation, and AI-driven solutions.

Requirements

  • 7–15+ years in software or production engineering, including 3+ years in engineering management or technical leadership.
  • Expertise in SRE/Production Engineering: observability, incident management, reliability engineering, SLOs/SLIs.
  • Track record in DevOps transformation, automation, CI/CD, and engineering best practices.
  • Experience with platform modernization (cloud migration, performance, re-architecture, reducing tech debt).
  • Demonstrated delivery leadership in agile environments.
  • Domain expertise in Balance Sheet products and finance data flows; ability to work with finance stakeholders and regulatory needs.
  • Experience with centralized data platforms (data warehouse/lakehouse, batch/streaming, lineage systems).
  • Familiarity with regression frameworks for data pipelines in SDLC.
  • Experience with AI in engineering ops (AIOps/GenAI), responsible implementation, and governance.
  • Knowledge of cloud data platforms, API access patterns, and strong security practices.

Responsibilities

  • Lead a cross-functional SRE team to enhance platform stability, resilience, and operational standards.
  • Define and measure SLOs/SLIs, manage incident processes, and improve availability and recovery (RTO/RPO).
  • Promote a metrics-based engineering culture (deployment frequency, MTTR, pipeline health) and drive prioritized outcomes.
  • Advance CI/CD maturity, automation, environment hygiene, and shift-left controls for proactive risk management.
  • Modernize platform components and optimize data pipelines for performance and scalability; enable API/cloud-first access and real-time data.
  • Expand automated testing, integrate observability into workflows, and improve documentation for transparency and audit readiness.
  • Drive AI adoption in platform operations (lineage tracing, AI-assisted incident triage, anomaly detection, “talk-to-your-data” features); partner on high-impact AI cases.
  • Oversee hiring, coaching, agile delivery, and align priorities with stakeholders.
  • Translate business goals into actionable roadmaps with clear KPIs and milestones.

Benefits

  • Comprehensive employee benefits and perks in the industry.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service