Senior System Administrator

LawZeroMontreal, QC
Onsite

About The Position

In charge of technology, the IT department oversees CyberSecurity, End User support, and the management of the compute environment used to achieve our mission. We are looking for a Senior System Administrator, part of an agile team reporting to the IT Director, focused on the administration of our on-premise and cloud computing environments, supporting our mission to build safe-by-design AI systems. You will be instrumental to our success by ensuring the availability, security, and performance of a platform containing hundreds of GPUs. You will have the opportunity to influence technical decisions and to work on a large variety of systems and hardware types in a cutting-edge environment. Your day-to-day will be spent between project-type work and operational management, with a security-focused mindset.

Requirements

  • University degree in a relevant discipline or equivalent experience
  • 8+ years of experience in systems administration, or demonstrated mastery of complex, large-scale Linux environments
  • Deep knowledge of Linux operation, configuration, and optimization
  • Strong network skills, including firewalls, routing protocols, switching, and technologies, such as Infiniband and Ethernet
  • Experience managing storage arrays, and network and distributed file systems, such as Lustre and NFS
  • Customer-service mindset, able to understand and communicate with a variety of stakeholders
  • Capacity to communicate technical concepts to a non-technical audience
  • Fluency in written and spoken English, mandatory.
  • Excellent priority management skills

Nice To Haves

  • Hands-on experience with distributed systems, such as Slurm or Kubernetes, an asset
  • Prior experience administering High-Performance Computing (HPC) clusters, an asset
  • French, a strong asset

Responsibilities

  • Take part in the operations and day-to-day management of LawZero’s various platforms and environments
  • Support internal users with their compute requirements
  • Implement automation, using Infrastructure-as-Code (IaC) tools such as Ansible, Terraform, and shell scripts
  • Ensure systems, network, and storage equipment are kept up-to-date and secure
  • Investigate hardware and software issues, and work with vendors for support and RMA requests
  • Respond to monitoring alerts and support requests in a timely manner
  • Propose and implement improvements to the platform and our processes
  • Maintain and evolve monitoring and alerting systems for performance and reliability
  • Develop and enforce security hardening measures, ensuring compliance and implementing best practices across all compute infrastructure
  • Document architecture and write standard operating procedures

Benefits

  • Comprehensive health benefits
  • A minimum of 20 days vacation per year upon start
  • A minimum retirement savings employer contribution of 4%
  • Generous flexible benefits designed to contribute to your well-being
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service