Lead Site Reliability Engineer

Turion SpaceIrvine, CA
1d$155,000 - $231,000

About The Position

Turion is looking for a Lead Site Reliability Engineer to help build monitoring and reliability systems that keep satellites connected to Earth. You'll build the observability infrastructure that ensures our space communications systems operate 24/7 for customers ranging from commercial satellite operators to national security missions. This is a high-growth role where you'll evolve from building core monitoring systems to potentially leading teams and architecting global-scale reliability platforms. You'll work directly with our platform engineering team to establish the monitoring, alerting, and deployment practices that will scale with us from startup to enterprise. If you're excited about space technology and want to build infrastructure that directly supports mission-critical satellite operations, this role offers that opportunity.

Requirements

  • 5+ years of relevant hands-on experience in production operations and 1-2+ years in a technical leadership role
  • Ability to work across multiple engineering disciplines and with diverse teams with strong communication and minimal oversight.
  • Experience with observability tools (Grafana, Prometheus, Loki, Alloy, ELK) in production environments
  • Hands-on experience with DR planning, failure mode analysis, and building resilient systems with automated failover and recovery
  • Familiarity with HashiCorp Vault, Okta, or similar identity/secrets management systems
  • Previous experience scaling infrastructure at high-growth companies (startup to 100+ employees)
  • Linux system administration experience and networking fundamentals
  • Strong experience with Kubernetes, Docker, and container orchestration in production environments
  • Hands-on experience with CI/CD tools and infrastructure as code (Terraform or Crossplane preferred)
  • AWS experience with multi-service deployments and programming skills for automation (Bash, Python)
  • Self-directed work style with ability to own projects from conception to production in fast-moving environments
  • Understanding of SRE principles, SLOs/SLIs, and systematic approaches to system reliability

Nice To Haves

  • Demonstrated success in executing large projects on tight timelines.
  • AWS certification or demonstrated expertise with advanced cloud networking and security
  • Interest in aerospace, telecommunications, or mission-critical systems
  • Already has a Secret or TS/SCI clearance that can be maintained

Benefits

  • Equity: Receive equity in Turion Space, letting you benefit from the company's success
  • Health Insurance: Comprehensive medical, dental, and vision coverage for employees and their dependents. ​
  • Retirement Plans: Access to a 401(k) plan to help you plan for your future. ​
  • Paid Time Off: Generous vacation days, personal days, sick days, and holidays to ensure you have time to recharge. ​
  • Professional Development: Opportunities for ongoing training, workshops, and courses to advance your skills and career growth.
  • Team Building Activities: Regular social events, team outings, and company-sponsored activities to foster a positive work environment. ​
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service