Lead Site Reliability Engineer

Centene Corporation•Northampton, MO

47d•Hybrid

About The Position

You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. As a diversified, national organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: Uses advanced experience to lead more complex projects from end-to-end that are focused on managing and maintaining optimum platform infrastructure performance, reliability, and security using SRE practices, observability tools, manual and automated procedures, documentation, people and processes and continuous delivery(CI/CD) tools, processes, and designs. Leads the development and delivery of complex services to automate monitoring activities and provide critical information to facilitate response and resolution of performance and availability issues and incidents. Leads the delivery of standardized and scalable software tools to ensure that systems operate without interruption at optimum performance and leads project teams through out the deployment process. Troubleshoots and analyzes service disruptions to determine the root cause of issues and develop solutions for improved reliability. Leads team to identify problems with systems and services and drives regular deployment of new versions of the systems and their subcomponents Leads projects from end-to-end that are focused on building and maintaining observability/monitoring for the application, monitoring key performance indicators, maintaining alerting, and continuously improving visibility. Drives decisions around periodic system validation and testing, service monitoring, and standing up new services/tools Uses advanced knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization Leads post incident reviews and documents findings for future informed decision making Drives implementation of approved proposals to optimize Software Development Life Cycle (SDLC) to boost service reliability Leads functional and development teams to investigate and document issues and leads internal team to develop solutions to mitigate them Leads root cause and problem solving initiatives Understand and adapt new technologies, tools, methods, and processes from Microsoft and industry Coaches and mentors team. Designs and implements key performance indicators Contributes to engineering and organization success by welcoming related, different, and new requests; helping others accomplish job results Trains the engineering team on new systems, protocols, and best practices Drive and coach others through reviews of design, code, and test cases Performs other duties as assigned Complies with all policies and standards

Requirements

A Bachelor's degree in a quantitative or business field (e.g., statistics, mathematics, engineering, computer science) and requires 5 – 7 years of related experience.
Or equivalent experience acquired through accomplishments of applicable knowledge, duties, scope and skill reflective of the level of this position.
Experience with Linux Operating System; Operating Systems; Unix Operating System; Windows Operating System
Experience with observability/monitoring tools such as Splunk, Dynatrace, Elastic, New Relic, Prometheus, Grafana
Experience with enterprise level CICD Tools such as Ansible, Jenkins, Cloudbees, OpenShift
Experience working in public cloud platforms like AWS and Azure
Experience with Programming Tools
Experience with building and operating highly scaled applications
Experience with MongoDB; MySQL; Oracle Database Management System (DBMS); PL SQL; SQL (Programming Language)
Experience with varying code repositories, auto deployments, branching with tools such as Gitlab, Bitbucket, Subversion
Experience with IT service management tools such as Service Now, Atlassian, BMC
Seeks to acquire knowledge in area of specialty
Ability to identify basic problems and procedural irregularities, collect data, establish facts, and draw valid conclusions
Ability to work independently
Demonstrated analytical skills
Demonstrated project management skills
Demonstrates a high level of accuracy, even under pressure
Demonstrates excellent judgment and decision making skills
Ability to communicate and make recommendations to upper management
Ability to drive multiple projects to successful completion
Possesses technical aptitude

Responsibilities

Leads more complex projects from end-to-end that are focused on managing and maintaining optimum platform infrastructure performance, reliability, and security using SRE practices, observability tools, manual and automated procedures, documentation, people and processes and continuous delivery(CI/CD) tools, processes, and designs.
Leads the development and delivery of complex services to automate monitoring activities and provide critical information to facilitate response and resolution of performance and availability issues and incidents.
Leads the delivery of standardized and scalable software tools to ensure that systems operate without interruption at optimum performance and leads project teams through out the deployment process.
Troubleshoots and analyzes service disruptions to determine the root cause of issues and develop solutions for improved reliability.
Leads team to identify problems with systems and services and drives regular deployment of new versions of the systems and their subcomponents
Leads projects from end-to-end that are focused on building and maintaining observability/monitoring for the application, monitoring key performance indicators, maintaining alerting, and continuously improving visibility.
Drives decisions around periodic system validation and testing, service monitoring, and standing up new services/tools
Uses advanced knowledge and experience to identify strategies that increase system reliability and performance through on-call rotation and process optimization
Leads post incident reviews and documents findings for future informed decision making
Drives implementation of approved proposals to optimize Software Development Life Cycle (SDLC) to boost service reliability
Leads functional and development teams to investigate and document issues and leads internal team to develop solutions to mitigate them
Leads root cause and problem solving initiatives
Understand and adapt new technologies, tools, methods, and processes from Microsoft and industry
Coaches and mentors team.
Designs and implements key performance indicators
Contributes to engineering and organization success by welcoming related, different, and new requests; helping others accomplish job results
Trains the engineering team on new systems, protocols, and best practices
Drive and coach others through reviews of design, code, and test cases
Performs other duties as assigned
Complies with all policies and standards

Benefits

competitive pay
health insurance
401K and stock purchase plans
tuition reimbursement
paid time off plus holidays
a flexible approach to work with remote, hybrid, field or office work schedules.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume