Site Reliability Engineer

Jobgether
1d$100,000 - $120,000Remote

About The Position

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Site Reliability Engineer in the United States. This role offers a unique opportunity to maintain and enhance the performance of large-scale, AI-powered operations across multiple facilities. You will ensure systems are reliable, resilient, and optimized while supporting hundreds of robotic systems. The position combines hands-on troubleshooting, infrastructure support, and software development to improve observability and automation. You will collaborate with cross-functional teams to resolve incidents, implement solutions, and create tools that increase operational efficiency. The environment is dynamic, fast-paced, and remote-friendly, with occasional travel to support on-site initiatives. This is an ideal role for someone passionate about technology, sustainability, and driving real-world impact through innovative solutions.

Requirements

  • Experience troubleshooting Linux systems and familiarity with containerized environments (Docker or similar).
  • Strong technical communication skills for collaborating with software teams and on-site personnel.
  • Interest in software development and understanding of coding standards, source control, build processes, and testing.
  • Ability to manage tasks independently within sprint-based or Kanban methodologies.
  • Strong interpersonal skills to handle high-pressure situations with industrial operations teams.
  • Passion for green technology and emissions reduction.

Nice To Haves

  • real-world experience with deployed hardware, reactive multitasking, and minimizing downtime.

Responsibilities

  • Provide first-line support for on-premises hardware, networking, operating systems, containers, and applications, including ticket triage and adherence to SLAs.
  • Participate in rotation for pager duty and escalation support for facility operations.
  • Develop and maintain observability, monitoring, alerting, and mitigation tools to proactively manage system reliability.
  • Enhance internal processes, documentation, and reporting for engineering support.
  • Translate lessons learned from incident response into tools and workflows that empower self-service across facilities.
  • Collaborate with cross-functional teams to troubleshoot issues, implement upgrades, and maintain system integrity.

Benefits

  • Competitive salary range: $100,000–$120,000 per year.
  • Eligibility for equity grants based on position and qualifications.
  • Medical, dental, and vision coverage with significant company contribution.
  • Life insurance and short/long-term disability coverage.
  • HSA-eligible health plans with company contributions.
  • 401(k) retirement plan.
  • Flexible time off, accrued sick days, and paid holidays.
  • Remote work flexibility with optional hybrid or in-office work at headquarters.
  • Opportunities for travel to support facilities and team collaboration.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service