Platform Engineer II (Observability)

Iterable
69d$114,000 - $188,000

About The Position

At Iterable, the Observability team enables engineering teams to measure, diagnose, and improve system health. We own and evolve Iterable’s monitoring, logging, tracing, and metrics platforms—turning raw telemetry into actionable insight. As a Platform Engineer II – Observability on our tight-knit team, you’ll drive reliability by implementing modern monitoring, automation, and orchestration practices that keep our systems performing at their best.

Requirements

  • 2+ years of professional software or infrastructure, SRE experience.
  • Hands-on work with Kubernetes (and Docker) in production.
  • Deep experience with at least one cloud provider (AWS preferred) and Infrastructure-as-Code (Terraform, Helm, GitOps).
  • Strong programming/scripting skills in Python, Go, or similar.
  • Experience using or supporting observability platforms (Datadog, Prometheus, Elastic, OpenTelemetry, etc.) in a production environment.
  • Familiarity with CI/CD pipelines and modern DevOps practices.
  • A growth mindset, humility, and a desire to elevate those around you.
  • Bachelor’s degree in CS/Engineering — or the equivalent real-world experience.

Nice To Haves

  • Built or run OpenTelemetry Collectors at scale.
  • Operated large K8s clusters or written controllers/operators.
  • Experience with GitOps.
  • Designed and executed observability cost optimization initiatives.
  • Experience in distributed tracing and high-cardinality metrics strategies.

Responsibilities

  • Own the full observability stack (Datadog, Prometheus, Grafana, Elasticsearch, Quickwit, OpenTelemetry)—design, deploy, and scale it to support petabyte-scale telemetry.
  • Instrument and automate monitoring, logging, tracing, and metrics to ensure system visibility across 100+ services and multiple Kubernetes clusters.
  • Ship platform features—contribute code that boosts reliability, performance, and developer experience across Iterable.
  • Partner with engineering teams to improve instrumentation, refine dashboards/alerts, and embed observability into their SDLC.
  • Reduce MTTR & cost—design cost-effective telemetry pipelines and create high-signal, low-noise alerting strategies.
  • Participate in our on-call rotation that prioritizes recovery, postmortems, and continuous improvement.

Benefits

  • Paid parental leave
  • Competitive salaries, meaningful equity, & 401(k) plan
  • Medical, dental, vision, & life insurance
  • Balance Days (additional paid holidays)
  • Fertility & Adoption Assistance
  • Paid Sabbatical
  • Flexible PTO
  • Monthly Employee Wellness allowance
  • Monthly Professional Development allowance
  • Pre-tax commuter benefits
  • Complete laptop workstation

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

Bachelor's degree

Number of Employees

501-1,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service