About The Position

As a Senior DevOps Engineer, you will design, build, and maintain the cloud infrastructure powering SkyFi's Earth Observation platform. You will work at the intersection of satellite technology and modern cloud-native systems, operating across GCP and AWS, managing production Kubernetes clusters, and championing GitOps-driven delivery. This role requires deep expertise in infrastructure-as-code, CI/CD, and site reliability practices, along with comfortable proficiency in Python for automation and operational tooling. The ideal candidate thrives with minimal supervision, excels when tackling ambiguous, high-impact problems, and is eager to learn about the fascinating Earth Observation industry.

Requirements

  • Active U.S. security clearance (required).
  • U.S. citizenship (required).
  • 6+ years of professional experience in DevOps, SRE, or Platform Engineering.
  • 5+ years of hands-on experience operating and managing Kubernetes in production environments.
  • Strong hands-on experience with both GCP and AWS
  • Proficiency with Terraform and Terragrunt for infrastructure provisioning and management.
  • Hands-on experience with Flux CD for GitOps-based continuous delivery.
  • Hands-on experience building and maintaining CI/CD pipelines with GitHub Actions.
  • Strong scripting skills in Bash and/or Python
  • Solid experience with Docker and container orchestration.
  • Deep understanding of modern DevOps principles, cloud-native architecture, and infrastructure-as-code practices.
  • Solid understanding and experience with observability systems like Grafana/Prometheus
  • Strong Linux systems administration skills.
  • Proactivity and ability to work with minimal supervision

Nice To Haves

  • Familiarity with service mesh technologies (e.g., Istio, Linkerd).
  • Previous experience supporting 24/7/365 production services.
  • Experience working in early-stage or high-growth startup environments.
  • Excellent organizational and documentation skills.

Responsibilities

  • Design, deploy, and maintain production Kubernetes clusters; own cluster lifecycle management, performance tuning, and capacity planning.
  • Build and manage cloud infrastructure across GCP and AWS using Terraform and Terragrunt, following infrastructure-as-code best practices.
  • Develop, optimize, and maintain CI/CD pipelines using GitHub Actions and Flux CD to enable reliable, GitOps-driven deployments of containerized applications.
  • Develop Python-based tooling and automation to support infrastructure and platform operations.
  • Troubleshoot and resolve operational, networking, pipeline, and infrastructure issues across multi-cloud environments.
  • Identify, document, and automate repetitive or critical workflows to reduce operational burden on the engineering team.
  • Implement and maintain comprehensive monitoring, alerting, and observability using tools such as Prometheus and Grafana.
  • Ensure compliance with security, governance, and regulatory requirements, including those tied to classified environments.
  • Collaborate with development and operations teams to gather requirements and translate them into reliable infrastructure solutions.
  • Partner with fellow engineers to architect, develop, and scale the product while keeping operational reliability and cost-efficiency in mind.
  • Champion cloud-native best practices, infrastructure-as-code principles, and GitOps workflows across the engineering organization.

Benefits

  • Be well compensated. Possibility for equity
  • Receive best-in-class benefits, including premium medical, dental, and vision coverage and 20 days paid time off
  • Play a critical role in building a market-changing product in the exciting realm of Space
  • Thrive in a fast-paced, dynamic environment that rewards initiative, innovation, and getting things done
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service