ITOps / Observability Engineer

The Ohio State University
18h

About The Position

The IT Operations and Observability Engineer (NOC) is responsible for monitoring, maintaining, and supporting the availability and performance of enterprise IT systems in a NOC-style operational environment. This role focuses on real-time systems monitoring, alert response, incident triage, and escalation using enterprise observability and monitoring tools. The position requires strong troubleshooting skills, attention to detail, and the ability to follow operational procedures to ensure stable, reliable system operations.

Requirements

  • Bachelors degree or equivalent combination of education and experience.
  • Candidates must have at least 4 years of experience in IT operations, NOC, or systems monitoring roles with hands-on exposure to enterprise monitoring and observability platforms such as SCOM, Splunk, SolarWinds, and cloud-native monitoring solutions (e.g., Azure Monitor, AWS CloudWatch, or equivalent).
  • Experience with incident triage, alert response, ticket-based workflows, and escalation procedures is required, along with strong troubleshooting, documentation, and communication skills.
  • Familiarity with Linux and Windows operating environments and distributed system monitoring is essential.

Nice To Haves

  • Preferred candidates will have experience in a 24/7 or shift-based NOC environment, familiarity with ITIL-aligned incident and problem management practices, experience tuning alerts and dashboards, and strong analytical skills using tools such as Excel or similar reporting platforms.
  • Experience with IT Operations / Systems Administration.
  • Experience supporting collaboration tools and integrating monitoring systems with alerting and incident response workflows is a plus.

Responsibilities

  • Monitors enterprise systems, networks, and services using tools such as SCOM, Splunk, SolarWinds, and cloud-based monitoring platforms, responding to alerts, anomalies, and service degradations in accordance with established operational procedures.
  • Triages incidents, opens and manages tickets using structured workflows (ServiceNow or Jira-equivalent systems), performs initial troubleshooting and impact assessment, and escalates issues to appropriate engineering or infrastructure teams as needed.
  • Maintains real-time operational dashboards, validates alert accuracy, and assists in tuning thresholds and notifications to reduce noise and improve signal quality.
  • Documents incidents, troubleshooting steps, handoffs, and resolutions to ensure continuity across shifts and contributes to the creation and maintenance of standard operating procedures, runbooks, and knowledge base articles.
  • Supports Linux and Windows environments, monitors distributed and network-connected systems in hybrid and cloud environments, validates data integrity within monitoring and reporting systems, and assists with asset tracking and environment awareness.
  • Provides operational support for internal users and technical teams while adhering to security, compliance, and change management requirements.

Benefits

  • Eligible Ohio State employees receive comprehensive benefits packages, including medical, dental and vision insurance, tuition assistance for employees and their dependents, and state or alternative retirement options with competitive employer contributions.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service