SRE Lead

NexthinkBoston, MA
2h$174,000 - $272,000Hybrid

About The Position

Nexthink is looking for a Lead Site Reliability Engineer who is passionate about building and running a high-performance cloud platform and enabling best-in-class site reliability and operations practices. This role will support US-based operations generally, but will in addition focus on enabling Nexthink to deliver to the US Public Sector market, in particular a FedRAMP Moderate offering. The candidate will drive the development of modern, cloud-native SRE processes and the management and operations for Nexthink’s multi-tenant, microservices-based cloud platform. The platform has multiple instances deployed across the globe. This role involves working closely with cross-functional teams to integrate reliability and security into our systems, ensuring they meet federal security standards. The ideal candidate will have extensive experience in both software engineering and systems administration, with a strong understanding of FedRAMP concepts, requirements and security practices.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
  • 5+ years of experience in site reliability engineering, DevOps, or a related role, with at least 2 years in a leadership or managerial position.
  • Proficiency in cloud platforms (AWS, Azure, GCP) and cloud-native services.
  • Strong scripting and programming skills (Python, Bash, Go, or similar).
  • Experience with Infrastructure as Code (IaC) tools such as Terraform, CrossPlane, CloudFormation, or Ansible.
  • Knowledge of containerization and orchestration (Docker, Kubernetes).
  • Familiarity with CI/CD pipelines and tools (Jenkins, GitLab, GitHub, etc.).
  • In-depth knowledge of FedRAMP requirements and best practices.
  • Experience with security tools and practices (SIEM, IDS/IPS, firewalls).
  • Understanding of network security, encryption, and secure software development practices.
  • Ability to collaborate with and foster effective communication with global engineering teams in EU and India timezones.

Responsibilities

  • Lead, mentor, and develop a team of US-based Site Reliability Engineers.
  • Foster a culture of continuous improvement, collaboration, and innovation.
  • Oversee the design, deployment, and management of scalable and secure cloud infrastructure.
  • Drive automation of infrastructure provisioning, configuration, and management using Infrastructure as Code (IaC) tools.
  • Develop and maintain comprehensive monitoring, logging, and alerting systems to ensure high availability and performance.
  • Lead efforts in performance tuning and optimization for applications and infrastructure.
  • Ensure implementation and maintenance of security controls and best practices to achieve FedRAMP compliance.
  • Conduct and oversee regular security assessments, vulnerability scans, and penetration testing.
  • Collaborate with the compliance team to prepare for and respond to FedRAMP audits.
  • Lead incident management efforts, ensuring rapid resolution and thorough root cause analysis.
  • Develop and implement strategies for improving incident response and minimizing downtime.
  • Work closely with development, operations, and security teams to integrate reliability and security into the software development lifecycle.
  • Communicate effectively with stakeholders, providing regular updates on system performance, reliability, and compliance status.

Benefits

  • Flexible Hours and unlimited vacation (employees have unlimited paid time off on top of the 15 days of holidays we offer), 11 company-paid holidays, and 3 extra days for volunteering.
  • Hybrid work model that balances office and remote work, with structured onboarding to foster connections and team integration.
  • Free access to professional training platforms to explore your interests and enhance your skills.
  • Up to 16 weeks of paid leave for birthing parents/primary caregivers, 6 weeks for secondary caregivers.
  • Plan for the future with a 401(k) plan featuring up to 4% company matching contributions, vesting immediately, to grow your retirement savings.
  • Bonuses for referring successful hires after three months of continuous employment.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service