Director, Soft Production Management & Reliability Engineering

Morgan StanleyAlpharetta, GA
2d$152,500 - $152,500Hybrid

About The Position

Morgan Stanley Services Group, Inc. is seeking a Director, Soft Production Mgmt & Reliability Engineering in Alpharetta, GA to Design, build, and maintain software applications and systems. Monitor applications to prevent and resolve issues. Troubleshoot both non-production and production issues across the entire stack: hardware, software, application and network. Identify and drive opportunities to improve automation for the company and scope and create automation for deployment, management and visibility of our services. Perform root cause analysis for outages and incidents. Conduct telemetry and statistics gathering in order to locate areas of the plant that can be improved. Maintain software applications once they are live by measuring and monitoring availability, latency, and overall system health. Update documentation, schedule jobs, and conduct triage calls for disaster recovery tests. Telecommuting permitted up to 2 days per week.

Requirements

  • Requires a Bachelor’s in Information Technology, Computer Science, or related field of study.
  • Requires five (5) years of experience in the position offered or five (5) years as an IT Analyst; Developer; Test Analyst; System Engineer; or related occupation in the technology field.
  • Requires five (5) years of experience with the following skills: .NET; C++; Java; Web Services (REST and SOAP); RDMS (relational database management system) in databases including Microsoft SQL server or DB2; Writing SQL queries; Designing, developing, and implementing technical solutions; Debugging applications and database troubleshooting and issue resolution; and Using databases and logs end-to-end.
  • Requires two (2) years of experience with the following skills: Linux/Unix; Scripting language (PERL, Python, or Shell); and Monitoring tools including Splunk.
  • Requires any amount of experience with following skills: Cloud based deployment, security, and networking concepts in AWS; and Automating deployments using Jenkins, Train, or Windeploy.

Responsibilities

  • Design, build, and maintain software applications and systems.
  • Monitor applications to prevent and resolve issues.
  • Troubleshoot both non-production and production issues across the entire stack: hardware, software, application and network.
  • Identify and drive opportunities to improve automation for the company and scope and create automation for deployment, management and visibility of our services.
  • Perform root cause analysis for outages and incidents.
  • Conduct telemetry and statistics gathering in order to locate areas of the plant that can be improved.
  • Maintain software applications once they are live by measuring and monitoring availability, latency, and overall system health.
  • Update documentation, schedule jobs, and conduct triage calls for disaster recovery tests.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service