Director - Java Developer – AI & Databricks Engineering

Morgan StanleyEdison, NJ
14d$95,000 - $135,000

About The Position

In the Technology division, we leverage innovation to build the connections and capabilities that power our Firm, enabling our clients and colleagues to redefine markets and shape the future of our communities. This is a Lead Infrastructure Production Management & Reliability Engineering position at the Director level, which is part of the job family responsible for maintaining the stability and reliability of the organization's infrastructure systems, ensuring optimal performance and availability to support business operations. Morgan Stanley is an industry leader in financial services, known for mobilizing capital to help governments, corporations, institutions, and individuals around the world achieve their financial goals. Interested in joining a team that’s eager to create, innovate and make an impact on the world? Read on. Role Profile: We are looking for an AI Java Developer to join our Platform Engineering team. This role will be critical in designing, building, and optimizing a scalable, secure, and developer-friendly Databricks platform to enable Machine Learning (ML) and Artificial Intelligence (AI) workloads at enterprise scale. You will partner with ML engineer, data scientists, platform teams, and cloud architects to automate infrastructure, enforce best practices, and streamline the end-to-end ML lifecycle using modern cloud-native technologies.

Requirements

  • 5+ years of experience in Engineering or a related field.
  • Proven experience with Terraform for building and managing infrastructure.
  • Strong programming skills in Java.
  • Proficiency with standard Linux command line and debugging tools.
  • Hands-on experience with cloud networking, identity and access management, key vaults, monitoring, and logging in Azure.
  • Hands on experience with Databricks (Workspace management, Clusters, Jobs, MLFlow, Delta Lake, Unity Catalog, Mosaic AI).
  • Deep understanding of Azure or AWS infrastructure (e.g. IAM, VNets/VPC, Storage, Networks, Compute, Key management, monitoring).
  • Strong experience in distributed system design, development and deployment using agile/devops practices.
  • Experience with CI/CD pipelines (GitHub Actions, or similar).
  • Experience implementing monitoring and observability using Prometheus, Grafana or Databricks-native solutions.
  • Good communication skills, excellent teamwork experience, ability to mentor and develop more junior developers, including participating in constructive code reviews

Nice To Haves

  • Experience with Databricks REST APIs and SDKs
  • Knowledge of MLFlow, Mosaic AC, & MLOps tooling
  • Working with teams using Scrum, Kanban or other agile practices
  • Strong programming skills in Python

Responsibilities

  • Design and implement secure, scalable, and automated Databricks environments to support AI/ML workloads.
  • Develop infrastructure-as-code (IaC) solutions using Terraform for provisioning Databricks, cloud resources, and network configurations.
  • Build automation and self-service capabilities using Java and APIs for platform onboarding, workspace provisioning, orchestration and monitoring.
  • Collaborate with data science and ML teams to define compute requirements, governance policies, and efficient workflows across dev/qa/prod environments.
  • Integrate Databricks offering with cloud-native services on Azure/AWS
  • Champion CI/CD and GitOps for managing ML infrastructure and configurations.
  • Ensure compliance with enterprise security and data governance policies using RBAC, Audit Controls, Encryption, Network Isolation, and policies.
  • Monitor platform performance, reliability, and usage, and drive improvements to optimize cost and resource utilizations

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service