IT Infrastructure Team Leader

Southern CompanyAtlanta, GA
7h

About The Position

The Team Lead will provide leadership and direction to a team of technical staff responsible for maintaining and supporting critical technology systems focused on monitoring and event management and response. This role involves overseeing day-to-day operations, managing projects, and ensuring compliance with company standards and regulations. The Team Lead will work closely with supervisors and other departments to enhance team effectiveness and drive innovation. The successful candidate should put Safety First and always demonstrate Our Values. This position is also responsible for product ownership and technical support of monitoring tools that are used by the 24x7x365 Infrastructure Operations Center to provide monitoring, root cause identification, and incident resolution for the Southern Company computing, network and transport infrastructure.

Requirements

  • B.S., B.A., or B.B.A degree in Computer Science, Mathematics, Engineering, Management Information Systems, Business, or another related field, is preferred.
  • Minimum of five years of experience with Infrastructure Services technologies.
  • Proven ability to provide leadership to a team, both formally and informally.
  • Experience with Information Technology risk assessments, internal controls, controls testing, and internal & external audits.
  • General knowledge of IT industry trends.
  • Strong analytical, problem-solving, and troubleshooting skills.
  • Excellent communication and organizational skills.
  • Ability to handle multiple tasks with competing priorities.
  • Ability to bring consensus and buy-in among people with different views and agendas.
  • Ability to grasp complex operational issues, risk factors, and business drivers.
  • Strong customer service and negotiating skills.
  • Ability to work with business partners and peers of varying levels of technical proficiency.
  • Experience in event management, event correlation, discovery, topologies, event driven automation, and root cause technology in a large enterprise environment with many different monitoring source systems.
  • Experience administering and supporting an enterprise monitoring platform.
  • Strong analytical and troubleshooting skills.
  • Technical experience in SNMP, Microservices, PHP programming, Elastic, OpenSearch, HTML, CSS, SQL (MySQL, MS SQL, Oracle), containers, scripting (Perl or Python), Linux systems and shell scripting, JSON.
  • Development experience in scripting or compiled languages.
  • 24/7 On Call support.

Responsibilities

  • Provide leadership and work direction, delagate tasks, assist with prioritization, provide estimates, and set deadlines in support of department processes and work requests.
  • Lead (initiate, drive, oversee) production support, break/fix, preventive maintenance, patching, request fulfillment, projects, product implementations, outage restoration, compliance, lifecycle replacement.
  • Coordinate projects and monitor progress to ensure timely completion.
  • Encourage creativity, innovation, risk-taking, and collaborative thinking to create value.
  • Develop work practices, processes, procedures, and documentation.
  • Lead, motivate, and develop employees.
  • Exhibit initiative and follow through with customer commitments.
  • Identify coaching opportunities and training needs to support employee development and recognize high performance.
  • Monitor team performance and report on metrics to support continuous improvement and the success of the organization.
  • Create an inspiring team environment to drive innovation and build a trusting culture.
  • Gather team feedback, support diversity of thought, and promote healthy conflict resolution.
  • Assist with the O&M and Capital budgets for the function, understand cost and variance details.
  • Understand and support the technical direction in Service Monitoring and Operations Management modernization.
  • Provide infrastructure product maintenance, license support, configuration customizations, product integrations, process and procedure automation and on-going support to maximize the functionality so as to provide reliable and sustainable monitoring of the compute, network, and transport infrastructure.
  • Provide support for new and modified monitoring rules and code and perform data investigation in support of day to day operations.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service