Director, Site Reliability Engineering (SRE)

Learning Ally IncPrinceton, NJ
63d

About The Position

As the Director of SRE & Security for Learning Ally, you will lead and contribute to a DevOps and Cybersecurity team that provides platform engineering, site reliability, and performance analysis to our technology department. You will work alongside product engineering and other technology leaders; you will develop infrastructure roadmaps, and assist in the architecture and development of foundational learning, teaching, rostering, and administrative systems to support our growing portfolio of products focused on delivering value to educators and students.

Requirements

  • 10+ years of hands-on experience in the field of software engineering, operations, cloud operations, & solution architecture, including 2+ year leading a team.
  • Strong experience and technical knowledge of cloud development (AWS, Azure, GCP).
  • Ability to efficiently work with large data sets, logs, analytics (DataDog, PowerBI, BigQuery).
  • Experience with Terraform, Cloudformation, Kubernetes, Docker, Ansible/Chef or similar technologies.
  • Build and maintain GitHub Actions / Jenkins CI and blue/green deployment pipelines.
  • Experience working with relational and non-relational databases like MySQL/MSSQL/PostgreSQL/MongoDB.
  • Demonstrated ability to build 'one-click' deployments.
  • Experience in deploying applications in Unix environments, and familiarity with MS Windows IIS / SQL Server environments for legacy maintenance and migration.
  • Security focused with a view of security embedded in the SDLC and each release.
  • Ability to work in the US.

Nice To Haves

  • Education Technology experience preferred.

Responsibilities

  • Lead and manage engineering teams, contracted engineers, and subprocessing vendors to optimize performance and ensure high-quality output.
  • Build solutions that are auto-scaling, self-healing, using automation & infrastructure as code (Terraform, Kubernetes, CloudFormation, autoscaling groups).
  • Design and develop a vision for service oriented architecture in close concert with our engineering team (across Windows and Linux-based environments).
  • Streamline software development by implementing reusable tooling for CI/CD pipelines and automating manual processes and associated developer and cybersecurity tooling (linting, SCAT/DCAT, unit testing, patch management, static analysis).
  • Identify network and application performance bottlenecks and security weaknesses, propose and implement solutions.
  • Maintain and continually improve an ISMS for regular ISO 27001 certification audits, assist CISO in oversight of Vanta-based compliance and cybersecurity program.
  • Leverage best in industry practices to support web applications and data platforms.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service