Senior Site Reliability Engineer - Splunk ITSI Specialist

The HartfordHartford, CT
401d$113,520 - $170,280

About The Position

The Hartford's RE&A Observability team is seeking a Senior Reliability Engineer to ensure the reliability of IT services with a focus on enhancing the developer experience. This role requires a strong problem-solving mindset and expertise in observability tools such as Splunk and Dynatrace. The engineer will be responsible for the design, build, and maintenance of services, aiming for service stability and effective software delivery while leveraging AI-driven insights for proactive issue resolution.

Requirements

  • Expertise in Splunk, Dynatrace, CDN, and other industry observability tools.
  • Strong problem-solving skills and innovative thinking.
  • Experience with AI-based systems is desired.
  • Hands-on experience with Performance and Observability tools such as Splunk ITSI, Dynatrace, CloudWatch, and CloudTrail.
  • Strong solution architecture orientation in a hybrid cloud environment.
  • Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Jenkins, Nexus, Rally, and SonarQube.
  • Effective communication and collaboration skills.

Nice To Haves

  • Knowledge of complex traditional and modern enterprise architectures and systems.
  • Strong hybrid cloud experience across various service delivery models - SRE, IaaS, PaaS, SaaS.

Responsibilities

  • Ensure the reliability of IT services focused on the developer experience.
  • Design, build, test, deploy, change, and maintain services with a focus on service stability and effective software delivery.
  • Guide the use of best-in-class software engineering standards and design practices for instrumenting code and application technology stacks.
  • Enable the generation of relevant metrics on overall technology health, including availability, performance, quality, technical debt, and resiliency.
  • Function as the go-to technical expert for the applications supported, requiring depth and breadth of knowledge in Splunk ITSI and related technologies.
  • Provide expertise in applications, integration, interfaces, and the business domain to drive insights and improvements.
  • Leverage Splunk ITSI and Dynatrace Davis AI capabilities to enhance predictive analytics and automated incident response.
  • Enable alerting, monitoring, service intelligence, noise reduction, self-healing, dashboards, and overall insights using Splunk ITSI and Dynatrace.
  • Enhance the delivery flow by engineering solutions with Splunk ITSI and Dynatrace to increase delivery speed while adhering to technology standards.
  • Implement preventative controls and drive increased automation and self-healing capabilities using Splunk ITSI and Dynatrace.
  • Drive the triaging and service restoration of all high-impact incidents to minimize mean time to service restoration.
  • Partner with infrastructure teams to design and implement intelligent incident routing and automated service restoration processes.

Benefits

  • Short-term or annual bonuses
  • Long-term incentives
  • On-the-spot recognition

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Insurance Carriers and Related Activities

Education Level

No Education Listed

Number of Employees

10,001+ employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service