Morgan Stanley-posted 3 months ago
Full-time • Senior
Alpharetta, GA
5,001-10,000 employees

In the Technology division, we leverage innovation to build the connections and capabilities that power our Firm, enabling our clients and colleagues to redefine markets and shape the future of our communities. This is a Software Production Management & Reliability Engineering position at Vice President level, which is part of the job family responsible for overseeing the production environment, ensuring the operational reliability of deployed software, and implementing strategies to optimize performance and minimize downtime. Morgan Stanley is an industry leader in financial services, known for mobilizing capital to help governments, corporations, institutions, and individuals around the world achieve their financial goals. Interested in joining a team that’s eager to create, innovate and make an impact on the world? Read on. The successful candidate will use their technical experience to own and drive internal firmwide applications integrations including DevOps enablement, deployment and integration of services in the cloud and on-premise offerings such as GitHub, Kubernetes and a various SaaS Cloud platform integrations.

  • Design, engineering, integration, and enhancements of Agile and DevOps enablement tools and applications by utilizing DevOps principles
  • Design and manage AI infrastructure: Build and maintain scalable, reliable, and secure cloud infrastructure (like on AWS, Azure, or GCP) that is optimized for resource-intensive AI workloads, often involving GPUs.
  • Develop MLOps pipelines: Create and manage automated Continuous Integration/Continuous Deployment (CI/CD) pipelines specifically for machine learning models.
  • Embed security practices and compliance standards (DevSecOps) directly into the platform and pipelines.
  • Follow SDLC process and practices (Functional Specifications and Testing, Design Specifications, Code Reviews, Unit Testing, Monitoring)
  • Administer Jenkins, Azure DevOps and GitHub to automate build and deployment processes
  • Manage and configure jFrog Artifactory and NexusIQ for effective package repository maintenance
  • Administer SonarQube for continuous code quality assessments
  • Oversee Docker, Rancher, and OpenShift to streamline container orchestration deployment
  • Experience with terraform, and Azure for Infrastructure provisioning and scaling
  • Administer, manage and configure Jira and Confluence to foster team collaboration and productivity
  • Implement, manage and fine-tune monitoring and alerting systems to ensure robust system performance and swift incident response
  • Utilize Python for scripting and automation tasks related to tool administration
  • Work closely with security teams to ensure tool compliance with organizational security policies
  • Configuring, building, and deploying applications into cloud (DevOps Pipelines, GitHub Actions)
  • Reduce Toil, increase automation, evaluate new technologies (Open AI) and explore their applicability to address new requirements
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 10+ years of experience in DevOps role
  • Experience in jFrog Artifactory, NexusIQ, SonarQube, Jenkins, Docker, Terraform, GitHub, Python, Jira, Confluence, Rancher, AWS, Azure services, OpenShift
  • Solid understanding of source control systems (Git, Subversion)
  • Strong understanding of monitoring and alerting best practices
  • Excellent analytical, troubleshooting, and communication skills
  • Experience working in Agile Environment
  • Experience in Finance Industry, Wealth domain is a plus
  • Comprehensive employee benefits and perks
  • Opportunities for career advancement within the company
  • Support for employees and their families at every point along their work-life journey
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service