Senior Software Engineer, Site Reliability

ZipRecruiterLos Angeles, CA
72d$140,000 - $200,000

About The Position

We are seeking a skilled software engineer with a focus on scalability, security, reliability, and extensibility. Our SRE team partners with product-centric engineering teams to provide infrastructure, tools, and architectural design to ensure the rapid and sustainable growth of all our services. Our team influences the thinking and methodologies of the entire engineering organization. A primary method for this is to provide smoothly paved paths toward good patterns, a few guard rails against particularly bad patterns, and a carefully minimized amount of toil. An excellent candidate will have the breadth of perspective to grasp both short term product feature goals and long term scalability goals, and the depth of technical skill to design systems that reconcile the two.

Requirements

  • 5+ years of relevant experience in industry
  • Experience architecting, developing, and troubleshooting large scale distributed systems
  • Fluency with multiple compiled or interpreted languages, ideally including Go
  • Experience with self-managed Kubernetes
  • Experience in the design and implementation of high-volume CI/CD pipelines
  • Strong low-level Linux skills; strace, netstat, and tcpdump should be old friends

Nice To Haves

  • Computer Science degree or relevant experience
  • Experience with AWS, and ideally with managing it with Terraform
  • Familiarity with large scale monitoring systems: Prometheus, Cloudwatch, Cloudtrail, etc

Responsibilities

  • Design, implementation, and troubleshooting of large fault-tolerant distributed systems
  • Design and implementation of a diverse ecology of tools and frameworks surrounding our Kubernetes clusters
  • Design, implementation, and continual refinement of a complex and performant CI/CD infrastructure
  • Embracing and enabling GitOps workflows throughout the organization
  • Participation in an on-call rotation covering core infrastructure
  • Practicing sustainable incident response and blameless postmortems

Benefits

  • Competitive compensation
  • Exceptional benefits package
  • Flexible Vacation & Paid Time Off
  • Employer-matched 401(k) plan
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service