About The Position

Grafana Labs is a remote-first, open-source powerhouse with over 20 million users of Grafana, its open-source visualization tool. The company helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, which includes scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo), available fully managed with Grafana Cloud or self-managed with Grafana Enterprise Stack. Grafana Labs is rapidly scaling while maintaining its open-source legacy, global collaborative culture, and passion for meaningful work, fostering an innovation-driven environment built on transparency, autonomy, and trust. The Observability team at Grafana builds end-to-end observability solutions covering application, infrastructure, database, browser, and mobile. The Real User Monitoring (RUM) initiative specifically focuses on capturing, storing, and querying high-volume user interaction data from browsers and mobile devices to provide insights into real-world user experiences at scale. This involves building systems to ingest and process massive amounts of telemetry (sessions, events, traces, logs) for real-time exploration, requiring expertise in high-performance backend systems, columnar storage, and intuitive frontend experiences. Solutions are integrated with OpenTelemetry and Grafana Cloud, prioritizing performance, cost-efficiency, and developer experience across the entire stack, from instrumentation to visualization. The team values open standards, excellent developer experience, and robust engineering for reliable, scalable software.

Requirements

  • 5+ years of fullstack engineering experience with strong backend fundamentals
  • Backend experience (Go is preferred) and frontend experience, we use TypeScript and React
  • Experience building or operating distributed systems in production (e.g., Kafka, WarpStream, ClickHouse, Cassandra, Postgres)
  • Familiarity with cloud-native systems (Docker, Kubernetes, AWS, GCP, Azure)
  • Experience working with high-throughput, high-cardinality data (logs, metrics, traces, events)
  • Strong understanding of data modeling, query optimization, and performance tradeoffs
  • Experience designing and building APIs and distributed services
  • Experience building data-heavy UIs (dashboards, query tools, debugging interfaces)
  • Familiarity with observability concepts (traces, logs, metrics) and/or OpenTelemetry
  • Strong communication skills and ability to work in a remote, distributed team
  • Pragmatic, self-driven, and comfortable navigating ambiguity
  • Customer-focused mindset with a passion for developer experience

Nice To Haves

  • Experience with browser or mobile instrumentation (RUM SDKs, telemetry collection).
  • Mobile development experience (iOS or Android) or familiarity with mobile performance and telemetry.
  • Contributions to OpenTelemetry or other observability OSS.
  • Experience building developer-facing platforms or observability products.
  • Familiarity with session replay, sampling strategies, or user behavior analytics systems.

Responsibilities

  • Build and evolve fullstack features for RUM, spanning backend services, APIs, storage systems, and frontend user experiences.
  • Design and implement systems that ingest, store, and query high-cardinality, high-volume telemetry data using columnar/analytical databases.
  • Develop performant query layers and APIs that power real-time exploration of user sessions, traces, and events.
  • Contribute to frontend applications that visualize RUM data, enabling users to debug performance issues and understand user behavior.
  • Work on data modeling, indexing strategies, and query optimization to ensure low-latency, cost-efficient analytics at scale.
  • Collaborate closely with SDK engineers (browser and mobile) to ensure high-quality data ingestion and schema evolution.
  • Own projects end-to-end: from design and implementation to deployment, monitoring, and iteration.
  • Break down complex, ambiguous problems into incremental deliverables and iterate quickly based on feedback.
  • Ensure quality through testing, observability of your own systems, documentation, and smooth upgrade paths.
  • Collaborate cross-functionally with backend, frontend, product, and solutions engineering to deliver cohesive observability workflows.
  • Support teammates, participate in technical design discussions and help shape the RUM roadmap.

Benefits

  • 100% Remote, Global Culture
  • Scaling Organization
  • Transparent Communication
  • Innovation-Driven
  • Open Source Roots
  • Empowered Teams
  • Career Growth Pathways
  • Approachable Leadership
  • Passionate People
  • In-Person onboarding
  • global annual leave policy of 30 days per annum
  • 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect
  • All of our roles include Restricted Stock Units (RSUs), giving every team member ownership in Grafana Labs' success
  • modern AI coding assistants as part of your daily workflow (your choice of tools, within security guidelines), backed by a company-funded usage budget
  • access to frontier models (e.g., GPT-Codex 5/3, Claude Opus 4.6, Gemini 3 Pro)

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Education Level

No Education Listed

Number of Employees

501-1,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service