Network Operations Analyst

Unity TechnologiesSan Francisco, CA
21dOnsite

About The Position

Join us in San Francisco as a Network Operations Analyst (IC5) and help shape how we measure, learn, and improve network reliability across our products and ecosystem. You will be responsible for turning real-time signals into clear insights, ensuring healthy network performance, and enabling key decision makers with transparent reporting. You’ll collaborate closely with Network Insights BI to build monitoring tools, transform data into actionable results, and respond to incidents with a calm, thoughtful approach. If you are a forward thinker who loves to build, learn, and persevere through complex challenges, we want you to join us and elevate our network health reporting to new heights.

Requirements

  • Proven track record translating complex telemetry into clear, decision-ready insights.
  • Demonstrated ability to implement monitoring dashboards in Looker and Grafana, and to query streaming data platforms like Kafka and Imply.
  • Effective written/ verbal communication with precise incident updates and clear documentation.
  • Ability to complete incident runbooks and restore normal network state with measurable recovery targets.
  • Consistent on-time reporting with >99% data freshness and accuracy for defined important metrics.

Nice To Haves

  • Experience designing automated anomaly detection with statistically sound thresholds (Users First).
  • Skill in coordinating multi-functional incident bridges and post-incident reviews (In It Together).
  • A critical thinking, thoughtful mindset that seeks the Best Ideas Win through experimentation and learning.

Responsibilities

  • Monitor daily network performance important metrics across all products and surface meaningful anomalies in Looker, Imply, Kafka, Grafana, and related tools.
  • Track ecosystem partnership signal health and escalate deviations with clear, timely insights.
  • Partner with Network Insights BI to design and implement monitoring tooling and automated alerting that scales.
  • Initiate workstreams for incidents and fix issues by coordinating with multi-functional BI teams; support post-incident analysis and impact quantification.
  • Publish concise, accurate daily and weekly network health reports that inform key decision makers.

Benefits

  • We offer a wide range of benefits designed to support employees' well-being and work-life balance. You can read more about them on our career page .
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service