Site Reliability Engineer

TamrCambridge, MA
2d$110,000 - $150,000Onsite

About The Position

We're looking for a Site Reliability Engineer to join our team and help build and maintain the platform services that power our infrastructure. You'll work with Go and Python to develop tooling, improve CI/CD pipelines, and manage containerized applications on Kubernetes. This role offers significant growth opportunities—you'll learn advanced SRE practices while contributing meaningfully from day one.

Requirements

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related technical field
  • Proficiency in Python and Go, including familiarity with best practices and common frameworks
  • Understanding of object-oriented and functional programming concepts
  • Experience with version control using Git and GitHub workflows (pull requests, code reviews, branching strategies)
  • Basic understanding of containerization concepts and Docker
  • Basic knowledge of Linux/Unix command line and shell scripting
  • Familiarity with Jenkins or other CI/CD tools
  • Hands-on experience with at least one of the major public cloud providers such as Amazon Web Services (AWS), Google Cloud Platform (GCP), or Microsoft Azure.
  • Familiarity with relational databases and SQL (PostgreSQL experience is a plus)
  • Understanding of RESTful APIs and microservices architecture concepts
  • Understanding of monitoring and logging tools

Nice To Haves

  • Hands-on experience with Google Cloud Platform services
  • Previous internship, co-op, or 1–2 years of professional software development experience
  • Experience with infrastructure-as-code concepts
  • Exposure to Kubernetes or container orchestration
  • Experience building Kubernetes Operators

Responsibilities

  • Develop and maintain platform services and tools in Go and Python
  • Build and improve CI/CD pipelines using Jenkins
  • Deploy and manage containerized applications on Kubernetes
  • Write and optimize database queries and schemas in PostgreSQL
  • Collaborate with senior engineers on infrastructure automation
  • Participate in code reviews and contribute to team best practices
  • Monitor and troubleshoot platform services
  • Document systems, processes, and runbooks
  • Participate in on-call rotation
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service