We’re determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals – and to help others accomplish theirs, too. Join our team as we help shape the future. Hartford Fire Insurance Company in Charlotte, NC has the following opening for a Manager, Reliability Engineer. Summary of Duties: Manage a team of Reliability Engineers through hiring, performance management, coaching and development. May include managing vendor partner relationships to create value for the organization. Guide the use of best-in-class software engineering standards, tools, and design practices to enable highly available and performant customer-facing applications. Lead adoption of metrics of overall application health - availability, performance, monitoring, alerting, quality, currency and resiliency. Serve as key liaison between the architecture and software engineering teams to influence the technical strategy for the organization, keeping in mind its cross-functional impacts, integration across the organization, and architecture rationalization. Function as the go-to technical expert for the applications and infrastructure supported, requiring depth and breadth of knowledge in technologies, applications, integration, interfaces and business domain. Develop effective tooling, alerts, and response mechanisms to identify and address reliability and security risks leveraging automation to support problem prevention, detection, mitigation, and resolution. Enhance the velocity of the SDLC by engineering the appropriate solutions to increase delivery speed while adhering to technology standards for sustained reliability. Progressively implement preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines. Promote and implement innovative solutions. Champion the migration of applications to open source platforms, PaaS, containers, serverless, event-based designs, and other cloud technology standards for cloud-enablement and platform agility. Drive simplification across the stack, responsible for ensuring that all technical designs can be effectively operated in a cost-efficient manner, without adding operational complexity. Drives inner- and open-sourcing practices to accelerate the development of self-service enterprise capabilities. Strong experience in setting up scalable SDLC environments using COTS, PaaS, SaaS products catering to Data, Application and Infrastructure-based pipeline needs. Ability to build solutions to promote migration of applications to open source platforms, PaaS and use of containers and other cloud technology standards for cloud-enablement and platform agility. Ensure operational excellence. Independently drive the triaging and service restoration of all high impact incidents in order to minimize the mean time to service restoration and impact to the business. Demonstrate endto-end ownership. Partner with infrastructure teams to design and implement intelligent automation and orchestration systems, enhanced monitoring/alerting capabilities and rapid service restoration processes. Take proactive measures to prevent high impactful incidents. Achieve and maintain the technical business continuity of Hartford and third-party assets that support customer-facing functions. Accountable for improving the IT application and infrastructure resiliency. Governance of overall D&A platform ecosystem with focus on process and solutions catering to Data masking (PII management), data lifecycle management needs.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager
Number of Employees
5,001-10,000 employees