Staff Site Reliability Engineer

SunrunLehi, UT
8dRemote

About The Position

Ever since we started in 2007, Sunrun has been at the forefront of connecting people to the cleanest energy on Earth. It’s why we’ve become the #1 home solar and battery company in America. Today, we’re on a mission to change the way the world interacts with energy, and we’re building a company and brand that puts power at the center of life. And we’re doing it by designing a dynamic culture where employee development, well-being, and safety come first. We’re unlike any other solar company. Our vertically integrated model gives us total control over every part of the energy lifecycle – from sale through installation and beyond – so you can find endless opportunities for growth. Come join a career you can grow in and a culture you can run with. This position is primarily remote, with occasional visits to a local office or our corporate headquarters for team-building, training, and collaborative project work. These on-site sessions are designed to strengthen connections, share insights, and ensure a seamless experience for our team and customers. Equipment pick-up from a local branch will be required. We will provide advance notice whenever on-site attendance is required, making these times purposeful and rewarding. Who We Are Sunrun is on a mission to make solar energy affordable for more people. We help people upgrade their home to solar energy without the big upfront costs. Sunrun is dedicated residential solar company in the country and has a mission to bring clean, solar power service to the masses. This position reports to our Lehi, UT office. May telecommute. Salary offered: $242,050/ year.

Requirements

  • Bachelor’s in Computer Information Systems, Software Engineering or closely related
  • 5 years of experience as a Software Developer using Microservices hosted in Azure, Virtualization and cloud computing, Object Oriented Design (OOD) & and Object-Oriented Programming (OOP), building software solutions in an engineering environment using Python & Shell scripting, Network analysis, debugging and troubleshooting with Wireshark & Git

Responsibilities

  • Infrastructure Leadership: Provide strategic leadership in designing, implementing, and managing the overall infrastructure strategy for our organization.
  • Cloud Technologies: Leverage cloud platforms (e.g., AWS, Azure) to design, deploy, and manage scalable infrastructure solutions.
  • Define and Elevate Monitoring Standards: Spearhead the definition of advanced monitoring requirements and elevate SLAs. Collaborate with the engineering team and TPM to implement and enhance monitoring practices.
  • Exceptional Communication Skills: Expertly convey intricate technical information to diverse stakeholders with clarity and precision.
  • Leadership in SRE Principles and System Design: Provide leadership in integrating advanced SRE principles into applications and services. Lead the implementation of sophisticated system design measures for heightened security, performance, and resiliency.
  • Strategic Notification Strategies and Incident Response: Develop strategic notification strategies for production outages. Leverage SLOs and SLIs to measure and optimize availability, latency, and response time. Lead and strategize emergency response efforts, conduct retrospectives with RCA, and manage on-call workloads effectively.
  • Holistic Production Environment Oversight: Oversee the holistic health of the production environment, emphasizing availability, and proactive monitoring. Drive advanced practices in application performance, capacity testing, and auto-scaling.
  • Innovative Support and Release Strategies: Spearhead innovative support and release strategies in collaboration with cross-functional teams. Lead initiatives to elevate services through advanced testing and release procedures.
  • Exemplary Documentation and Automation Practices: Champion exemplary documentation practices for actions, findings, and automation procedures. Identify and lead initiatives for advanced automation solutions.
  • Strategic Influence on Product Roadmap: Collaborate closely with engineering and product counterparts to strategically influence improved resiliency and reliability. Identify and lead major projects for substantial enhancements in reliability, cost savings, and revenue.
  • Strategic Efficiency and Capacity Planning: Drive strategic efforts in efficiency and capacity planning. Establish and communicate clear requirements while optimizing system resource usage.

Benefits

  • Medical/Dental/Vision Insurance
  • Life Insurance
  • Disability Insurance
  • 401k Plan + Company Match
  • Stock Purchase Plan
  • Paid Vacations/Holidays
  • Paid Baby Bonding Leave
  • Employee Discounts
  • PowerU - 100% Funded Education Programs
  • Employee Donation Matching
  • Volunteer Hour Rewards
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service