ACL Digital-posted 3 months ago
Senior
Mountain View, CA
1,001-5,000 employees
Professional, Scientific, and Technical Services

We are seeking a Senior Site Reliability Engineer (SRE) to join our team in Mountain View, CA. This position is a 6-month contract-to-hire role focused on ensuring the reliability and performance of our QuickBooks infrastructure, which supports over 10 million active users. The ideal candidate will have extensive experience in Site Reliability Engineering and a strong background in automation and cloud services.

  • Design, develop, and maintain automation frameworks for performance testing and monitoring of QuickBooks infrastructure.
  • Ensure the scalability and reliability of services supporting 10M+ active users.
  • Build and optimize tooling using Python to automate deployment, monitoring, and operational tasks.
  • Work with AWS cloud services to architect resilient and efficient infrastructure.
  • Partner with developers, QA, and operations teams to embed SRE best practices into the product lifecycle.
  • Monitor, troubleshoot, and improve system performance and participate in on-call rotations.
  • 5+ years of experience in Site Reliability Engineering, DevOps, or related roles.
  • Strong hands-on expertise with AWS (EC2, S3, RDS, Lambda, etc.).
  • Proficiency in Python for scripting and automation.
  • Experience with CI/CD pipelines, containerization (Docker, Kubernetes), and observability tools (Prometheus, Grafana, Datadog, etc.).
  • Proven ability to troubleshoot complex distributed systems.
  • Excellent collaboration and communication skills.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service