Sr. Site Reliability Engineer

Jerry.aiPalo Alto, CA
1d

About The Position

Jerry.ai is seeking a Sr. Site Reliability Engineer to take complete ownership of our infrastructure. We've achieved product-market fit and are in a rapid scaling phase, making this a critical moment for our team. We need a seasoned professional who can lead the charge in building, maintaining, and optimizing the core systems that power our business. In this role, you'll be the key driver of automation, scalability, and reliability across our entire infrastructure and CI/CD pipelines. This is a hands-on position where you'll be responsible for more than just keeping the lights on. You will be a strategic partner, designing and implementing solutions that enable our engineering teams to move faster and deliver high-quality products to our customers. Jerry.ai is building the first super app to make car ownership affordable and accessible – insurance, buy/sell, registration, loans, safety, repairs, parking, etc – a $2T market in the U.S. We started with insurance in 2019, and since then we’ve launched loan refinancing, driving insights, repair marketplace, car diagnostics, and a GenAI-powered chatbot & voicebot. We have amassed over 5M customers, raised $240MM in funding, scaled our revenue 60X and our team to 225 across 6 countries.

Requirements

  • B.S. degree in Computer Science or related discipline;
  • 5+ years of experience working with Amazon AWS services and tools, such as ECS, EKS, RDS, S3, ElasticCache, ELB, ECR, cdk etc;
  • Meticulous attention to detail to ensure systems are always up
  • Solid coding skills preferably in Python or Golang, familiarity with NodeJS and Typescript, and strong system analysis and problem solving skills;
  • Excellent teammate with strong communication, collaboration, and technical writing skills;
  • Highly motivated and thrive in dynamic and fast-paced environments with a real passion for ensuring scalable and highly available systems in the cloud

Responsibilities

  • Take full ownership of Jerry’s infrastructure including managing our cloud accounts end to end, ensuring robustness and security
  • Develop systems / components to improve operational simplicity and reliability of Jerry’s product systems, for example, monitoring system, deployment automation, and diagnosis tools etc;
  • Develop systems and tools to manage the security access and capacity planning of cloud resources;
  • Work with various product teams to guide systems architecture and offer suggestions for reliability and optimization of production systems;
  • Be the shepherd to coordinate Jerry’s production incident handling across teams

Benefits

  • health, dental, and vision coverage
  • paid time off
  • paid parental leave
  • 401(K) plan with employer matching
  • wellness benefits
  • equity grants
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service