Manager Site Reliability Engineering

DeepwatchTampa, FL
8hRemote

About The Position

Come join Deepwatch’s team of world-class cybersecurity professionals and the brightest minds in the industry. If you're ready to challenge yourself with work that matters, then this is the place for you. We're redefining cybersecurity as one of the fastest growing companies in the U.S. – and we have a blast doing it! Who We Are Deepwatch is the leader in managed security services, protecting organizations from ever-increasing cyber threats 24/7/365. Powered by Deepwatch’s cloud-based security operations platform, Deepwatch provides the industry’s fastest, most comprehensive detection and automated response to cyber threats together with tailored guidance from dedicated experts to mitigate risk and measurably improve security posture. Hundreds of organizations, from Fortune 100 to mid-sized enterprises, trust Deepwatch to protect their business. Our core values drive everything we do at Deepwatch, including our approach to tackling tough cyber challenges. We seek out tenacious individuals who are passionate about solving complex problems and protecting our customers. At Deepwatch, every decision, process, and hire is made with a focus on improving our cybersecurity solutions and delivering an exceptional experience for our customers. By embracing our values, we create a culture of excellence that is dedicated to empowering our team members to explore their potential, expand their skill sets, and achieve their career aspirations, which is supported by our unique annual professional development benefit. Deepwatch recognition includes: 2025, 2024, 2023, 2022 and 2021 Great Place to Work® Certified 2024 Military Times Best for Vets Employers 2024 US Department of Labor Hire Vets Gold Award 2024 Forbes' America's Best Startup Employers 2024 Cyber Defense Magazine, Global Infosec Awards 2023 and 2022 Fortress Cybersecurity Award 2023 $180M Series C investment from Springcoast Capital Partners, Splunk Ventures, and Vista Credit Partners of Vista Equity Partners 2022 Cybersecurity Excellence Award for MDR Manager, Site Reliability Engineering Reports to: VP, Product Engineering Lead the architecture, automation, and reliability of secure, scalable cloud infrastructure (AWS, GCP) and developer platforms within a cybersecurity context. Inspire DevOps excellence, deliver high availability, and drive operational resilience—all while mentoring a high-caliber SRE team. Lead and grow a small high caliber global SRE Team, managing US based engineers. Lead the architecture, automation, and reliability of secure, scalable cloud infrastructure (AWS, GCP) and developer platforms within a cybersecurity context. Inspire DevOps excellence, deliver high availability, and drive operational resilience—all while mentoring and managing a high-caliber SRE team.

Requirements

  • 8+ years in SRE, DevOps, or Platform Engineering; with technical leadership experience ready to step into management as a player/coach.
  • Proven cloud experience (AWS, GCP) and container orchestration (Kubernetes, Docker).
  • Strong coding/scripting (Python, GO) and proficiency in IaC and GitOps.
  • Deep knowledge of observability tools and defining reliability metrics.
  • Experienced in incident handling (PagerDuty, Datadog) and post-incident evaluations.
  • Demonstrated success in mentoring and developing junior/mid-level SRE talent, moving beyond delegation to hands-on technical coaching.
  • Familiarity with regulatory or cybersecurity frameworks (FedRAMP, NIST, STIGs, RMF).
  • Excellent cross-functional communication and stakeholder management.

Nice To Haves

  • certifications such as AWS, CKA, or cyber security credentials (e.g., OSCP).

Responsibilities

  • Lead and grow the SRE team, setting direction, mentoring and managing engineers, and fostering excellence.
  • Design and manage cloud and containerized infrastructure with IaC (Terraform).
  • Implement robust CI/CD pipelines integrating security and compliance.
  • Build scalable observability systems, leading the definition of SLIs / SLOs and dashboards.
  • Manage incident response, root cause analysis, and postmortems; automate recovery via playbooks/runbooks.
  • Drive capacity planning, performance tuning, and cost efficiency.
  • Collaborate with InfoSec, DevSecOps, and Compliance teams—ensuring alignment with frameworks like FedRAMP, NIST, RMF.
  • Support program-level initiatives, communicating effectively with stakeholders.
  • Promote a culture of reliability, security, and developer efficiency.
  • Maintain an active 'player' role, dedicating approximately 75% of your time to hands-on engineering (design, coding, and architecture) and 25% to leadership, mentorship, and management.

Benefits

  • Medical, dental, vision, and disability insurance
  • Flexible Time Off (FTO), 12 company holidays, sick leave and 8-Weeks Paid Parental Leave
  • Unique professional development benefits with Annual “development dollars” to support our people growth and development
  • Wellness contests and monthly educational programs
  • 401(K) retirement program
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service