About The Position

At Allstate, great things happen when our people work together to protect families and their belongings from life’s uncertainties. And for more than 90 years, our innovative drive has kept us a step ahead of our customers’ evolving needs. From advocating for seat belts, air bags and graduated driving laws, to being an industry leader in pricing sophistication, telematics, and, more recently, device and identity protection. Job Description The Sr. Engineering Manager is the direct leader of more critical groups with significant responsibilities. These groups may sometimes be more heavily resourced with more senior staff or more staff in general or be responsible for more technically complex or strategically important parts of the architecture or product. Sr. Engineering Managers are expected to build relationships throughout the organization to ensure whole-company alignment and support for their goals, and vice-versa. The Sr. Engineering Manager architects and designs their digital products using modern tools, technologies, frameworks, and systems. They apply a systematic application of scientific and technological knowledge, methods, and experience to the design, implementation, testing, and documentation of software. They own and manage running their application in production, and ultimately becomes accountable for the success of their digital products through achieving KPIs. Role Partner with product teams to improve service reliability and user experience. Ensure adequate coverage for incident response through effective on-call management. Respond to incidents, conduct thorough post-mortems, and implement preventive measures to avoid future issues. Drive automation to reduce toil and improve delivery quality. Mentor engineers on technology and effective SRE practices. Create strategic plans to grow Site Reliability practice. Provide coaching and mentorship for a global team.

Requirements

  • Disciplined approach to maintaining and enforcing engineering best practices.
  • Proven experience proactively improving availability while reducing MTTR.
  • Significant experience providing front-line support for critical issues.
  • Experience supporting enterprise platforms such as Apigee, Kafka, and/or TIBCO.
  • Experience using observability tools such as DataDog to create dashboards and find the source of issues.
  • Technical background in distributed computing, hybrid cloud, and infrastructure support.
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.
  • Able to effectively communicate with all levels of the organization.
  • Detail oriented, self-organized, and capable of simultaneously tracking multiple issues of varying complexity.
  • 10 or more years of experience

Responsibilities

  • Partner with product teams to improve service reliability and user experience.
  • Ensure adequate coverage for incident response through effective on-call management.
  • Respond to incidents, conduct thorough post-mortems, and implement preventive measures to avoid future issues.
  • Drive automation to reduce toil and improve delivery quality.
  • Mentor engineers on technology and effective SRE practices.
  • Create strategic plans to grow Site Reliability practice.
  • Provide coaching and mentorship for a global team.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service