Senior Cloud Infrastructure Engineer

TaskrabbitSan Francisco, CA
35d$147,000 - $196,000Hybrid

About The Position

Taskrabbit is looking for an experienced Senior Cloud Infrastructure Engineer to drive the next phase of our Platform Modernization ("PlatMod") initiative. In this high-impact role, you won't just keep the lights on; you will build the next generation of infrastructure from scratch to support both Taskrabbit and Dolly. We are moving fast to decommission legacy tooling (i.e Jenkins) in favor of a modern, cloud-native stack (Kubernetes, ArgoCD, Crossplane). You will be a hands-on owner—taking vague requirements and turning them into resilient, scalable solutions while advocating for your technical designs to the wider engineering organization. As a core member of a lean, collaborative team, you will also play a critical role in stabilizing our environment. You will participate in the on-call rotation, drive incident response, and implement the fixes necessary to ensure our services remain highly available (99.9%) for our users worldwide.

Requirements

  • At least 5+ years of experience in the Infrastructure and DevOps Space.
  • Experience with build automation and configuration management tools (e.g. Ansible, Puppet, Chef.)
  • Strong knowledge of the Amazon Web Services (AWS) ecosystem and other core AWS technologies, ElasticSearch Service, RDS, WAF, CloudFront, Kubernetes etc.
  • You have worked with common infrastructure tools like Docker, Terraform, Helm, GitHub Actions, and ArgoCD.
  • Experience with a microservice architecture running in containers (Docker or other containerisation technology).
  • Experience supporting 24x7, high-availability internet application environments that include web, application, and database servers and load balancing systems.
  • Experience working with a product that has end-users.
  • Bachelor's degree or higher in Computer Science, or equivalent experience.
  • Excellent written and communication skills.
  • A strong ownership attitude and a track record of taking responsibility for problems and pushing through to resolution.

Nice To Haves

  • AWS Certification is a plus.
  • Software development background is a plus.
  • Experience in a startup environment is a plus.

Responsibilities

  • Build and maintain new modern infrastructure such as Kubernetes, new CI/CD tools and assisting with application teams on adapting.
  • Building and maintaining CI / CD pipelines from scratch for testing and releasing configuration and software.
  • Monitor and resolve issues in all environments using tools such as DataDog, PagerDuty, and AWS logs.
  • Engage in capacity planning and demand forecasting, anticipating performance bottlenecks, and scaling the environment as needed using DataDog and other tools.
  • Design and implement a zero-downtime solution to accomplish a highly available service (99.9%).
  • Ensure systems are secure against cyber threats and implement fixes for Security vulnerabilities.
  • Automate tasks and develop tools to increase engineering efficiency and visibility.
  • Design and implement disaster recovery (DR) between different regions in cloud providers such as AWS.
  • Manage web domain and certificates.
  • Troubleshoot production and testing environment issues, including performance and function issues.
  • Provide support to the organization through on-call, resolving issues and driving infrastructure changes.
  • Identify, define, and document system requirements and recommend solutions to management.
  • Perform on-call duties and be part of the on-call rotations (weekends/evenings; as needed).

Benefits

  • employer-paid health insurance
  • 401k match with immediate vesting
  • generous and flexible time off with 2 company-wide closure weeks
  • Taskrabbit product stipends
  • wellness + productivity + education stipends
  • IKEA discounts
  • reproductive health support
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service