Staff Site Reliability Engineer

Redwood MaterialsReno, NV
84d

About The Position

Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition. Founded in 2019, we’re delivering low-cost and large-scale energy storage and producing battery materials in the U.S. for the first time, all from batteries we already have. Staff Site Reliability Engineer Essential Duties: We are seeking a highly skilled and motivated Staff Site Reliability Engineer to collect requirements, design & implement highly available systems & solutions, coordinate work across multiple teams, drive improvements to existing systems, introduce automation, integrations, and ensure appropriate monitoring & alerting is in place for rapid response. This role will collaborate, assist with, lead projects & drive initiatives to ensure Redwood Materials has resilient systems in place to scale at a rapid pace to a global enterprise.

Requirements

  • Bachelor’s degree in information technology or any related field.
  • 2+ years in an SRE related role, and 5+ years in an IT Systems related role
  • Experience administering IT Infrastructure such as VMware, Active Directory, Windows Server, Linux, Networking, Cloud Infrastructure (AWS, Azure), Load balancing & Monitoring
  • Expertise in scripting, coding, automation, and integration with tools such as Python, Ansible, Chef, Puppet, REST, YAML, JSON, etc
  • Self-motivated, hands-on mindset, with a willingness to contribute at all levels.

Nice To Haves

  • Experience working with SCADA, OT, MES, or other industrial related software & systems is preferred.
  • Experience with DR playbooks, capacity modeling, and cost/performance optimization in hybrid environments
  • A passion for sustainability and making the world a better place!

Responsibilities

  • Collect business & technical requirements and work with cross-functional teams to establish SLOs
  • Design effective on-premise & hybrid systems & solutions with high availability & scalability, utilizing platform technologies including vSphere, Kubernetes, Linux, Windows.
  • Coordinate work across IT, Software, Industrial Controls, Engineering & Business teams to implement complete systems & ensure business needs are met.
  • Identify opportunities to automate deployment & management of IT infrastructure & systems to reduce manual efforts and speed recovery.
  • Develop integrations that streamline use & visibility of data across components to deliver complete, efficient systems providing excellent utility & ease of use.
  • Support deployed systems responding to incidents, leading fast triage, troubleshoot issues, and participate in an on-call rotation.
  • Lead post-incident reviews and drive improvements to eliminate repeat failure modes
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service