Responsible for responding to all major systems and service incidents during business hours and extended business hours in support of IT Incident Management program. Creates and maintains unified monitoring of Infrastructure, Application and Business & IT services to proactively detect, predict and prevent service, application and security problems. Monitors security and network operations in a 24x7 environment and escalates exceptions based on established procedures. Participates in on-call rotation supporting production systems. Performs initial triage, correlation, and documentation of security, availability, and service incidents. Investigates alerts using standard tools and predefined queries; escalates incidents requiring advanced analysis or coordination. Executes established incident response and availability playbooks for repeatable events. Maintains accurate incident records and provides status updates to stakeholders during the incident lifecycle. Utilizes and maintains monitoring dashboards and alert views (e.g., ServiceNow, Splunk, Orion, Tenable, AppDynamics, Sentinel). Uses prebuilt dashboards and analytics to identify potential issues (service degradation, security events, insider risk indicators). Follows established monitoring rules and procedures to support proactive fault detection and reduce alert noise. Coordinates with internal teams and vendors for resolution of assigned incidents. Tracks SLA adherence and ensures data quality for reporting and KPI tracking. Maintains working knowledge of tools, processes, and incident response best practices.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Entry Level