Under the direction of an Information Technology Specialist 3 within the Office of Information Technology Services - Dedicated Support Team/ Tax. Specific duties include but are not limited to: Monitoring the health and performance of OpenShift clusters, containers, nodes, and applications using tools such as Instana, OpenShift alerts, Prometheus, and Grafana. Deploying and managing Instana agents for comprehensive application performance monitoring and tracing. Designing and maintaining Grafana dashboards that leverage data from Prometheus and OpenShift to provide actionable insights. Assist with the configuration and troubleshooting of a centralized logging solution using Elasticsearch, Logstash or Beats, and Kibana. Evaluating workload criticality and understanding the impact of disruptions or performance degradation is essential for maintaining system reliability. Proactive configuration changes to minimize operational disruptions, along with the creation and fine-tuning of real-time alerts, thresholds, and escalation rules to detect anomalies and incidents early. Collaboration with development and DevOps teams is important to improve visibility across systems and services. Responding to alerts to ensure timely resolution of incidents, ensuring that monitoring tools remain up-to-date and compliant with organizational policies, and providing regular reports on system health and performance trends. Keeping support documentation current is expected.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Industry
Executive, Legislative, and Other General Government Support
Number of Employees
251-500 employees