Staff Site Reliability Engineer

ClickUp
102d$200,000 - $242,500

About The Position

ClickUp is revolutionizing the way the world works. As the only all-in-one productivity platform built from day one for true convergence, ClickUp unifies tasks, docs, chat, calendar, enterprise search, and more—supercharged by context-driven AI. While others scramble to bundle fragmented tools or bolt on AI, we anticipated this future and made it our foundation from the start. Headquartered in San Diego with a rapidly expanding global footprint, we empower over three million teams to break free from silos and reclaim their time—saving at least one day every week. Join ClickUp, one of the fastest-growing SaaS companies on the planet, and help millions of users transform the way they work. We’re not just building software. We’re shaping the future of work. Come join us in building the future—together. 🦄 ClickUp is the world’s only all-in-one productivity platform that flexes to the way people want to work. It replaces all individual workplace productivity tools with a single, unified platform that includes project management, document collaboration, whiteboards, spreadsheets, and AI. With our headquarters based in San Diego and a rapidly expanding global presence, we are shaping the future of work. Join our team at ClickUp, one of the fastest-growing SaaS companies worldwide, and help millions of users be more productive - saving them at least one day every week. 🦄 We are looking for driven and innovative software engineers with strong site reliability engineering (SRE) discipline or interest in this area to help us make ClickUp the 'one app to rule them all'. As an SRE at ClickUp, your primary roles will be improving the stability, availability and reliability of our globally distributed and cloud-based infrastructure that powers our app for thousands of users daily. If you are a rockstar engineer with an entrepreneurial and high-paced mindset who are ready to own, drive and tackle some of the most complex problems there are out there we would love to hear from you!

Requirements

  • 4-6+ years of knowledge of the Amazon Web Services ecosystem (EC2, ECS, VPC, Redis, RDS, ALB, ECR)
  • Experience working with Kubernetes
  • Experience in managing production-critical infrastructures and DevOps mindset.
  • Be familiar with SRE best practices and procedures.
  • Experience with IaC (CDK, Terraform), CI/CD (GitHub Actions, ArgoCD)
  • Familiar with Containerisation (Docker)
  • Knowledgeable in network, firewall, and security best practices.
  • Experience with self-healing automation and monitoring tools (DataDog, CloudWatch)
  • Knowledge of relational databases, preferably PostgreSQL (not mandatory)
  • A strong self-starter, operationally-focused; a problem-solver.
  • Excellent interpersonal, written, and oral communication skills.

Nice To Haves

  • Experience with application security testing is a plus (not mandatory)
  • Familiarity or experience with Node.js is a plus (not mandatory).
  • Experience with management of Linux-based EC2 instances.

Responsibilities

  • Lead designing and building systems for maximum performance, reliability, and scalability.
  • Serve as a lead in partnership with engineering teams on product design, decisions, and troubleshooting.
  • Increase general stability, observability, and metrics surrounding both uptime and stability.
  • Champion our monitoring infrastructure.
  • Implement and improve our general site reliability posture (error and downtime budgets, MTTD and MTTR improvements, improving alerting and notifications, minimizing customer impact from incidents, etc.)
  • Respond to and troubleshoot downtime events while actively developing safeguards to prevent them.
  • Participate in brainstorming sessions with the engineering team and contribute ideas to our technology and algorithms.
  • Mentor members of the team to improve overall excellence.

Benefits

  • Equity
  • 401k
  • Health, Dental, and Vision insurance
  • Spending accounts
  • Life & Disability
  • Paid parental leave
  • Flexible paid time off
  • Enhanced employee assistance program
  • Employee wellness stipend
  • Professional development stipend
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service