This role focuses on responding to and resolving complex client and user operational issues, working with the Enterprise Applications team to define requirements and specifications, and designing/modifying solution strategies to meet SLA requirements. The position involves application configuration, extension, integration, and performance tuning, as well as handling escalated issues and improving standard operating procedures. A significant part of the role involves designing, implementing, and managing monitoring and observability solutions across various environments, configuring observability platforms, developing dashboards, setting up alerting, and monitoring performance against SLAs. The engineer will analyze telemetry data for trends and risks, support incident and problem management, integrate monitoring tools, reduce alert noise, support cloud migrations with observability, collaborate with DevOps/SRE teams, perform capacity planning, automate monitoring processes, maintain operational documentation, ensure compliance, and contribute to continuous improvement initiatives like AIOps and self-healing.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior