Senior Software Engineer, Platform

GridmaticCupertino, CA
1d

About The Position

Gridmatic is a high-growth startup and a new kind of energy company, delivering affordable, clean power by optimizing renewable energy and grid-scale batteries. With offices in the Bay Area and Houston, we bring together Silicon Valley–style innovation with deep, hands-on expertise in real-world power markets and energy retail. As solar and wind become the fastest-growing sources of electricity, variability from weather and grid conditions makes energy prices more volatile. Gridmatic tackles this challenge with industry-leading forecasting and optimization—and gives our team the opportunity to work on problems that truly matter. Forecasting and trading energy are the foundation of what we do. We ingest large-scale data—weather, prices, load, and grid conditions—to build probabilistic machine learning forecasts that drive real operational decisions. Our work directly determines when power is bought, stored, or deployed, turning uncertainty into value for customers and the grid. Our impact is measurable. Gridmatic is the most profitable participant in ERCOT’s wholesale market and operates the top-performing battery asset in CAISO. Profitable without venture capital, we offer a collaborative, low-ego environment where rigorous thinking, autonomy, and continuous learning are core to how we work. We're looking for a Senior Software Engineer to join our Platform team and build the foundational infrastructure that powers Gridmatic. Our platform challenges are shaped by the nature of energy markets: forecasts and trading decisions run on tight schedules, battery dispatch commands must execute reliably in real time, and ML models need to train and deploy continuously as new data arrives.

Requirements

  • Significant experience building and operating production infrastructure on a public cloud platform (we run on GCP, but AWS or Azure experience translates well)
  • Hands-on experience with Kubernetes (we run on GKE and it's foundational to our platform)
  • Proficiency in Python
  • Experience with infrastructure-as-code tools like Terraform
  • Either already know Go or have experience with a similar systems language (C++, Java, Rust) and are excited to work in Python and Go day-to-day
  • Systems thinking—understanding how components interact, where failures can cascade, and how to build for efficiency, scalability, and resilience
  • Clear communication, whether writing a design doc, reviewing code, or explaining a complex system to someone new to it
  • Opinions about how to build reliable infrastructure, informed by experience with what works and what doesn't at scale

Nice To Haves

  • Background in ML infrastructure or training pipelines
  • Experience with workflow orchestration tools (Flyte, Temporal, Airflow, or similar)
  • Familiarity with observability tooling (we use Grafana + Google Cloud Monitoring)
  • Experience with real-time systems
  • Strong Linux fundamentals
  • Prior work in domains where latency and reliability have direct business consequences

Responsibilities

  • Design and build our compute platform, creating infrastructure that supports production services, batch jobs, and ML training workloads
  • Work on real-time systems for operating battery assets, where reliability directly impacts both revenue and grid stability
  • Establish patterns, tooling, and best practices that help teams across Gridmatic run services reliably
  • Make architectural decisions that shape how we build software as we grow
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service