Data Center Analyst II

IRENVancouver, BC
Onsite

About The Position

The Data Center Analyst II supports the Integrated Operations Control function by providing timely customer and operational analytics that ensure system reliability and high service performance. Through monitoring, incident analysis, and reporting, this role transforms operational data into actionable insights that support customers, improve response effectiveness, and drive continuous improvement in data center operations.

Requirements

  • 3-4 years experience in an operations support or analyst role, specifically focused on system maintenance and tool management within an IOC/NOC/SOC environment.
  • Deep proficiency in maintaining and optimizing IOC tools, including ITSM ticketing systems (e.g., ServiceNow, Jira) and monitoring platforms (e.g., Splunk, Datadog).
  • Proven ability to configure and tune alert thresholds, workflows, and automation rules for various infrastructure events (GPU cluster, network, facility).
  • Experience with user access, permissions, and role-based controls across critical operational systems to ensure security and compliance.
  • Strong capability in generating operational metrics, incident summaries, and performance reports for executive review.
  • Demonstrated skill in managing the RCA documentation workflow, ensuring timeliness, quality, and accurate data retention.
  • Hands-on experience creating and updating operational documentation and training materials for IOC staff.
  • Technical knowledge of data integrity and compliance requirements for operational records and system logs.
  • Proactive approach to identifying tool gaps and workflow inefficiencies to support continuous improvement and automation initiatives.
  • Bachelor's degree in technical field or equivalent experience in systems administration or operations support.

Responsibilities

  • Maintain and optimize IOC tools, including ticketing systems, monitoring platforms, dashboards, and alerting configurations.
  • Calculate, validate, and process Service Level Agreement (SLA) credits in accordance with contractual terms
  • Manage user access, permissions, and role-based controls across all IOC systems to ensure security and compliance.
  • Configure and tune alert thresholds, workflows, and automation rules for GPU cluster, network, and facility events.
  • Generate operational metrics, incident summaries, and performance reports for leadership review.
  • Create, update, and organize RCA documentation; manage the RCA workflow to ensure timeliness, quality, and compliance with customer expectations.
  • Support onboarding and ongoing training of IOC staff by preparing documentation, guides, and tool-related resources.
  • Ensure data integrity, accuracy, and retention of compliance across all IOC systems and operational records.
  • Assist with continuous improvement initiatives by identifying tool gaps, workflow inefficiencies, and opportunities for better monitoring or automation.

Benefits

  • RRSP matching program
  • Relocation assistance and support
  • Comprehensive extended health and dental coverage
  • Paid vacation
  • Professional development to support certifications, continuing education, or role related training
  • Company events and team-building activities
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service