Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together. OptumServe Enterprise Monitoring team is looking for an Observability Engineer. The team is responsible for enterprise infrastructure, application, and network monitoring for on-prem, hybrid, and various Clouds. The selected candidate will be joining a team of skilled engineers with a broad background in enterprise monitoring and Observability. As an Observability Engineer, this role is focused on maintaining the reliability, scalability and availability of our Log management solution as well as our Metrics and Observability platform which heavily uses automation (terraform, Ansible and scripts), this role requires maintaining performance KPI of our solutions and defining their SLOs. Primary Responsibilities: Maintain and deploy monitoring and alerting Design, configuration and maintenance of log aggregation solution at a large scale Set up and manage ingestion pipelines and data transformations Have the mindset of "automate any task" Monitoring and Alerting: Build and maintain robust monitoring systems using tools like Elk, Dynatrace, Prometheus, OTEL and Grafana to detect potential issues early and trigger alerts for timely response Maintain associated documentation as it applies to our audit and certification requirements Participate in troubleshooting, capacity planning, and performance analysis activities Research new monitoring requirements and in many cases write code for that Medium to expert level in setting up AI rules for tools like DavisAI (Dynatrace) and/or Elastic GenAI Solid expertise in setting up monitoring policies/rules/templates; and writing scripts to accomplish monitoring requirements You'll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees