About The Position

Grafana Labs is a remote-first, open-source powerhouse. There are more than 20M users of Grafana, the open source visualization tool, around the globe, monitoring everything from beehives to climate change in the Alps. The instantly recognizable dashboards have been spotted everywhere from a NASA launch and Minecraft HQ to Wimbledon and the Tour de France. Grafana Labs also helps more than 3,000 companies -- including Bloomberg, JPMorgan Chase, and eBay -- manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack, both featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo). We’re scaling fast and staying true to what makes us different: an open-source legacy, a global collaborative culture, and a passion for meaningful work. Our team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do. You may not meet every requirement, and that’s okay. If this role excites you, we’d love you to raise your hand for what could be a truly career-defining opportunity. This is a remote position. We are looking for candidates in the United States time zones. What is Grafana Cloud? Grafana Cloud is our composable observability platform that integrates metrics, logs, traces, and profiles with Grafana. It allows our customers to leverage the best open source observability software – including Prometheus, Mimir, Loki, Tempo, and Pyroscope – without the overhead of installing, maintaining and scaling their own observability stack. The Databases team owns and operates the telemetry databases that are Mimir for metrics, Loki for logs, Tempo for traces, and Pyroscope for profiles. Our databases are offered as a hosted service in Grafana Cloud, and additionally as on-premise solutions with Grafana Enterprise Metrics, Grafana Enterprise Logs, and Grafana Enterprise Traces. They are multi-tenant distributed systems implemented in Go and operating at scale on Kubernetes across all major Cloud service providers (AWS, GCP, Azure). As a company we are remote-first and global, we embrace people of different experiences and backgrounds to build diverse teams where every person brings a new perspective to the software. Mimir Squad The Mimir squad has 3 sub-squads, Ingest, Storage, and Query, which together maintain the Mimir OSS project, and additionally own and operate Grafana Cloud Metrics across 3 major cloud providers. Engineers on the team focus on optimizing the efficiency and resilience of processing, storing, and querying metrics at large volumes. These services operate at a large scale and performance is key to keeping the offering competitive and running smoothly. A Mimir engineer has various work streams. They are likely engaged in a larger project with another engineer, and they are also incorporating some performance and reliability improvements discovered through operating the system in production. They are also responsible for writing and reviewing PRs and design documents from other engineers in the squad, shepherding automated release rollouts, and participating in the on-call rotation for their systems. What will you be doing?

Requirements

  • You are a motivated self starter with a bias towards action. You are customer focused. We build everything with our users in mind. You have a passion for building intuitive products that fit customers’ needs. You have good time management skills, which you leverage to work on the right things at the right time.
  • Pragmatism: You have a bias towards action, taking direction and building a plan of action to analyze, design, and build modular solutions, deliver MVPs, gather data and feedback and then progress iteratively
  • Collaboration and communication: The smallest unit we have is a squad. You’ll be working with your teammates in a fully remote setup. Good communication and time management skills are a must
  • AI: some experience using LLMs for day to day coding tasks and understanding a codebase.
  • Experience with at least one programming language. We use Go, but if you have familiarity with Python, C, C++, Rust or similar then that translates well
  • Some experience with delivering projects as a member of a larger team. Your experience includes gathering requirements and brainstorming ideas, all the way to shipping features to the customer’s hands.
  • Some experience with developing software that runs in the Cloud or some experience with systems engineering
  • Some experience with being on-call and following the DevOps model
  • Experience writing clean, robust, and performant software that is easily maintained by others
  • Some familiarity with observability systems, know when to use metrics, logs, traces, to debug a problem.

Nice To Haves

  • Experience working with Kubernetes
  • Experience working with queue systems, e.g. the Kafka protocol
  • Been a user of Grafana and Prometheus in operational roles (including on-call for your team at a previous employer or just using these tools on hobby/homelab projects)
  • Exposure to microservices architecture and distributed systems, or a desire to learn
  • Familiarity with the concept of infrastructure as code

Responsibilities

  • Take an active role in influencing our roadmap and your own career objectives
  • Work with your team to deliver new features, then use the results to iterate and improve.
  • Help your team drive projects from initial idea all the way to operations once it is in the hands of customers
  • Embrace our open-source culture and contribute to other projects that may not directly fall within your team’s scope
  • Build, operate, and maintain critical systems, owning the reliability, performance, and availability
  • Be a part of your team’s follow-the-sun on-call rotations and take ownership of the services you’re running
  • Support other team members, participate in design discussions and collaborate with the team
  • Learn new skills by gaining a deeper understanding of our cloud product and our customers and getting to know the codebase of a large distributed system

Benefits

  • We invest heavily in developer productivity. You can use modern AI coding assistants as part of your daily workflow (your choice of tools, within security guidelines), backed by a company-funded usage budget so you can iterate quickly without unnecessary friction.
  • We encourage pragmatic AI-assisted development: faster prototyping, test generation, refactors, documentation, and incident follow-ups—always paired with strong code review and quality standards.
  • You’ll also have access to frontier models (e.g., GPT-Codex 5/3, Claude Opus 4.6, Gemini 3 Pro).
  • equity
  • bonus (if applicable)
  • Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect. We will comply with local legislation where applicable.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

501-1,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service