Senior DevOps Engineer

Terawatt InfrastructureToronto, ON
$148,000 - $170,000Remote

About The Position

Join TeraWatt Terawatt Infrastructure is at the forefront of the transition to autonomous and electric vehicles, requiring significant investment in energy and charging infrastructure. The company specializes in delivering large-scale, turnkey charging solutions for businesses deploying AV and EV fleets. Terawatt develops, finances, owns, and operates charging solutions, simplifying the electrification of fleets. Their technology platform transforms sites into an intelligent network using advanced power management, operational optimization, and data-driven systems, offering hard performance guarantees. The company operates at the intersection of software, hardware, real estate, energy systems, and live site operations, with code directly impacting physical infrastructure and orchestrating megawatts of power. Role Description: As a DevOps Software Engineer at TeraWatt, you will contribute to the evolution of the platform supporting one of North America's leading fleet EV charging solutions. You will help develop and ensure the reliability of the charging network management system, enabling seamless charging and a high-quality customer experience. Working closely with the Director of Software and the Product team, you will expand the platform, deliver value in a dynamic industry, scale cloud infrastructure, and support organizational growth by implementing best practices for reliability, performance, and maintainability. This role is ideal for a DevOps or Cloud Infrastructure Software Engineer passionate about building scalable, impactful technology with a collaborative team, contributing directly to sustainable transportation infrastructure.

Requirements

  • 5+ years of experience building and operating high availability production software systems, preferably in DevOps or platform engineering teams.
  • Experience building and maintaining scalable cloud-based infrastructure, including services running in managed Kubernetes (EKS).
  • Experience building or maintaining CI/CD pipelines (e.g., GitHub Actions) to support reliable software delivery.
  • Experience leading or contributing to SRE or DevOps initiatives supporting production cloud platforms.
  • Experience with observability frameworks and tools (e.g., OpenTelemetry, Grafana, or similar platforms).
  • Experience working with managed databases such as PostgreSQL, MongoDB, or similar systems.
  • Strong communication skills and the ability to collaborate effectively across engineering, product, and infrastructure teams.

Nice To Haves

  • Experience working with multi-region AWS infrastructure and Kubernetes (EKS) at scale.
  • Experience improving security and compliance practices through automation and internal tooling.
  • Experience implementing or scaling observability standards using OpenTelemetry and tools like Grafana Cloud.
  • Experience maintaining or scaling working with data infrastructure, such as Databricks, Kafka (MSK), or similar streaming/data platforms.
  • Proficiency in Python or NodeJS.

Responsibilities

  • Contribute to the evolution of our cloud infrastructure using Terraform, supporting the build-out of resilient and scalable systems to support business growth.
  • Maintain helm charts and deployment patterns that enable teams to manage the lifecycle of their services while adhering to established deployment standards.
  • Build and maintain CI/CD pipelines using GitHub Actions to support engineering teams in owning their application deployment process.
  • Apply security best practices across all layers of the stack, including software access, managed workloads, and services running in pre-production and production environments.
  • Strengthen cloud and network security using industry-standard tools to detect vulnerabilities and anomalies, and help prevent suspicious or malicious activity.
  • Implement advanced observability practices using frameworks such as OpenTelemetry (OTel) and tools like Grafana Cloud for monitoring and alerting across services and infrastructure.
  • Develop tooling that supports both local and remote container-based cloud development workflows.
  • Create and automate simulated production scenarios used for testing during development and validating production releases.
  • Design and manage infrastructure that supports machine learning model training and deployment, ensuring scalable compute resources for ML workloads.
  • Partner with the Data team to manage core data infrastructure, including our Databricks data lake and Kafka event streams (Aiven/AWS) while advising on scalable data architecture and infrastructure improvements.
  • Contribute to building a highly available, web-based depot operations platform that supports the future of EV charging using NodeJS.
  • Participate in a 24/7 on-call rotation to support the reliability of production systems.

Benefits

  • Comprehensive benefits package
  • Performance-based incentives
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service