Sr. Software Engineer, DevOps

AKASASan Francisco, CA
Hybrid

About The Position

AKASA is seeking a Sr. Infrastructure Engineer to join their Infrastructure and Platform teams. The role focuses on managing, improving, and scaling the systems that power AKASA's products, with an emphasis on reliability, observability, automation, operational excellence, and cross-functional collaboration. The engineer will help build and maintain foundational infrastructure for SaaS applications, including Kubernetes, Terraform-managed cloud resources, and GitHub-based CI/CD pipelines. While incident response is part of the role, the primary focus is on proactive improvements such as reducing operational toil, enhancing system visibility, and enabling product teams to operate with confidence. The role also involves contributing to monitoring and observability efforts to detect issues before they impact customers and collaborating with software engineers to embed reliability best practices. This position is ideal for individuals who enjoy a blend of hands-on engineering, systems design, automation, and technical mentorship. The role is based in South San Francisco, with a requirement to attend co-working days in the office every Wednesday.

Requirements

  • Experience with metrics, logs, and traces using tools such as Grafana, Prometheus/Mimir, OpenSearch, Sentry, or similar.
  • Proficient with Terraform, Kubernetes, and containerization tools.
  • 5+ years of experience with Python.
  • Comfortable working with Linux-based environments and writing shell scripts.
  • Strong collaboration skills with a focus on asynchronous, written communication.
  • Commitment to clear, comprehensive documentation and process standardization.
  • Self-starter mindset with a proactive approach to solving operational challenges.
  • Skilled in Git/GitHub-based workflows.

Nice To Haves

  • AWS (preferred), GCP, or Azure cloud infrastructure management.
  • Familiarity with TCP/IP, DNS, routing, and load balancing concepts.
  • Understanding of cloud and infrastructure security best practices.
  • Experience tuning application or infrastructure performance in production environments.

Responsibilities

  • Build, manage, and optimize infrastructure using Terraform, GitHub CI/CD, and Kubernetes.
  • Create visualizations and alerts that provide actionable insights using tools like Grafana, Prometheus/Mimir, OpenSearch, and Sentry.
  • Identify manual or error-prone processes and replace them with automated, repeatable systems.
  • Diagnose and resolve production issues across application and infrastructure layers.
  • Capture knowledge in runbooks, setup guides, and architecture diagrams to support operational maturity.
  • Partner with engineers across teams to drive adoption of DevOps and infrastructure best practices.
  • Help scale infrastructure and monitoring systems to meet growing demands.
  • Participate in an on-call rotation and support incident response processes as needed.

Benefits

  • Flexible paid time off (PTO)
  • Expansive coverage for health, dental, and vision
  • Employer contribution to Health Savings Accounts (HSA)
  • Generous parental leave policy
  • Full employee coverage for life insurance
  • Home office stipend
  • Cell phone/internet reimbursement
  • Company-paid holidays
  • 401(K) plan
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service