Cloud Engineer

ScotiabankToronto, ON
Onsite

About The Position

Combining the disciplines of DevOps, Systems Administration, and Cloud Engineering, the Cloud Operations Engineer contributes to the reliability, scalability, and performance of cloud-based systems. This role focuses on hands-on execution, operational support, and continuous improvement within a dynamic environment. You will work closely with Cloud Engineering and application teams to support system reliability, enhance operational processes, and maintain cloud infrastructure. You will also help implement tools, processes, and monitoring solutions to support system health and enable efficient responses to operational events.

Requirements

  • 5–7 years of experience in Cloud Operations, Systems Administration, or Enterprise Operations
  • Experience working in enterprise environments with distributed systems
  • Strong problem-solving and troubleshooting skills
  • Good communication skills and ability to work collaboratively
  • Hands-on experience with Microsoft Azure and/or Google Cloud Platform (GCP)
  • Experience supporting cloud-based or hybrid environments
  • Familiarity with Kubernetes and containerized workloads
  • Understanding of secure and highly available infrastructure principles
  • Experience with Terraform (IaC), GitHub, and CI/CD pipelines (e.g., GitHub Actions, Jenkins)
  • Basic understanding of GitOps and release processes
  • Scripting experience using PowerShell, Python, and/or Bash
  • Familiarity with configuration management tools such as Ansible
  • Experience with monitoring and observability tools (e.g., Dynatrace or similar)
  • Understanding of incident, problem, and change management processes
  • Working knowledge of cloud architecture and distributed systems concepts
  • Experience working in Agile or Lean teams

Responsibilities

  • Support reliability and operational stability of infrastructure and applications
  • Assist in improving availability, scalability, and performance
  • Troubleshoot and resolve system issues, escalating complex problems when needed
  • Follow and contribute to best practices for resilience and high availability
  • Build, operate, and maintain cloud infrastructure in Azure and GCP environments
  • Implement and maintain monitoring, alerting, and observability solutions
  • Support performance optimization, disaster recovery, and access management activities
  • Contribute to automation initiatives to improve operational efficiency
  • Participate in day-to-day operations including incident, problem, and change management
  • Support incident response and assist in root cause analysis and post-incident reviews
  • Maintain and follow runbooks and operational procedures
  • Collaborate with application and engineering teams to support system reliability
  • Apply cloud and operational best practices in daily work
  • Participate in Agile ceremonies and team discussions
  • Contribute to peer reviews and continuous improvement initiatives

Benefits

  • Upskilling through online courses, cross-functional development opportunities, and tuition assistance.
  • Competitive Rewards program including bonus, flexible vacation, personal, sick days and benefits will start on day one.
  • Free tea & coffee, universal washrooms, and lots of space for team collaboration.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service