Site Reliability Engineering Manager, Consumer

Attain DataChicago, IL
54dHybrid

About The Position

As the Site Reliability Engineering Manager, you will manage our consumer SRE team in building out and maintaining the infrastructure and supporting tools that power all of our B2C applications, as well as ensure their uptime, stability and security. You will work closely with the leads of the other engineering and product teams at Attain in helping to architect our systems for security, observability, reliability and scalability. You will lead a team of hard working, driven and supportive SREs setting the direction, vision, and priorities for your team. You will work hands-on with our GCP, AWS and Kubernetes environments. You will be an owner within the engineering organization and be able to make a direct impact on the millions of users of our applications.

Requirements

  • Are a self-motivated leader who thrives on ownership and adaptability
  • Bring rigor and process to SRE while fostering collaboration and continuous learning
  • Are passionate about automation and hands-on infrastructure management
  • Value feedback and personal growth

Nice To Haves

  • 6+ years building and maintaining large-scale cloud-native infrastructure (AWS and/or GCP)
  • Experience leading SRE teams and cross-functional communication
  • Proven success managing maintenance and outages for large-scale consumer applications
  • Skilled in Kubernetes, Istio, Prometheus, and Argo
  • Proficient in SQL, event streaming, and pub/sub
  • Familiar with serverless technologies and infrastructure-as-code (Terraform)
  • Strong computer science and engineering fundamentals
  • Knowledge of SOC2 and PCI compliance

Responsibilities

  • Lead architecture and capacity planning discussions to ensure systems are scalable, reliable, and secure
  • Manage daily SRE operations, from ticket refinement and estimation through resolution and monitoring
  • Refine and document monitoring and alerting for B2C applications
  • Track and optimize consumer SLIs and SLOs
  • Introduce new processes and technologies to advance consumer infrastructure
  • Build and maintain platforms, CI/CD pipelines, networking, access controls, and infrastructure using Terraform
  • Develop Helm charts for Kubernetes deployments using Istio, Argo, and Prometheus
  • Monitor and maintain BigQuery, Spanner, Postgres, and MySQL databases

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Manager

Industry

Securities, Commodity Contracts, and Other Financial Investments and Related Activities

Education Level

No Education Listed

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service