Senior Engineer, Cloud Operations

Bed Bath & BeyondBrampton, ON
CA$105,000 - CA$130,000

About The Position

Sleep Country Canada is looking for a Senior Cloud Engineer to play a key role in ensuring the reliability, performance, and security of Sleep Country’s multi-cloud environment. This position combines hands-on engineering with strategic problem-solving to design, build, and optimize cloud-based systems, while driving continuous improvement and innovation across cloud operations.

Requirements

  • Bachelor’s degree in computer science, Information Systems or a related field.
  • Approximately 5+ years of progressive experience in IT infrastructure or cloud engineering, with a focus on deploying and managing cloud-based environments.
  • Demonstrated success in supporting complex, distributed systems and services in a production (24/7) environment.
  • Deep hands-on knowledge of public cloud platforms (such as AWS, Microsoft Azure, or Google Cloud Platform), including experience with compute, storage, networking, and managed services.
  • Proficiency in designing highly available, scalable cloud architectures and implementing cloud networking concepts (VPC/VNet configuration, security groups, load balancers, etc.).
  • Strong understanding of virtualization and containerization technologies.
  • Strong experience with infrastructure automation and DevOps practices.
  • Proficiency in writing scripts (e.g., PowerShell, Python, Bash) to automate system tasks and deployments.
  • Experience building and maintaining CI/CD pipelines (using tools such as Jenkins, GitHub Actions, or Azure DevOps) to automate build, test, and deployment processes.
  • Proven ability in monitoring, incident management, and performance tuning for cloud-based systems.
  • Experience using modern observability tools (such as Dynatrace, Datadog, CloudWatch, etc.) to collect metrics, logs, and traces, and to troubleshoot complex issues.
  • Strong analytical and problem-solving skills, with the capacity to perform root cause analysis and implement effective fixes under pressure.
  • Comfortable working in an on-call rotation, and capable of making sound decisions quickly during critical incidents to restore service.
  • Excellent communication skills, both written and verbal, with the ability to document procedures and clearly convey technical information to teammates and stakeholders.
  • Demonstrated curiosity and adaptability in keeping pace with evolving technologies and practices in cloud computing, including willingness and aptitude to learn and utilize new tools (including AI-driven operations tools, automated monitoring/alerting systems) to improve efficiency and reliability.

Nice To Haves

  • Experience with container orchestrators (e.g., Kubernetes) is an asset.
  • Experience with BigCommerce, Shopify, or similar cloud-based eCommerce platforms.
  • Familiarity with Oracle Cloud Infrastructure (OCI) and/or Oracle Fusion Cloud applications in an operational setting considered an asset.

Responsibilities

  • Design, build, and maintain cloud infrastructure across multiple platforms (e.g., AWS, Azure, GCP) to support business applications and services.
  • Implement configurations for compute, storage, networking, and cloud services following best practices for scalability and resilience, performing regular maintenance, patching, and environment updates to ensure systems remain current and performant.
  • Monitor cloud systems and services using enterprise monitoring and alerting tools to ensure uptime and performance, Investigating and troubleshooting incidents and problems, performing root-cause analysis and timely remediation of cloud infrastructure issues.
  • Participate in a rotating on-call schedule for after-hours support, responding to critical incidents to restore services and minimize downtime in line with service level objectives.
  • Develop and manage automation scripts and CI/CD pipelines to streamline cloud deployments and operations, Utilizing infrastructure-as-code (e.g., Terraform, CloudFormation) to provision and configure cloud resources reliably and repeatedly.
  • Implement automated build, test, and deployment processes in collaboration with DevOps and development teams, reducing manual effort and improving consistency across environments.
  • Analyze system performance metrics and identifies opportunities to improve efficiency, reliability, and cost-effectiveness of cloud operations, optimize resource usage and application performance through tuning and capacity planning.
  • Evaluate and adopt new tools and practices (including emerging AIOps platforms, intelligent monitoring, and auto-remediation technologies) to enhance operational capabilities.
  • Contribute to improving operational playbooks, runbooks, and knowledge bases for the cloud operations function.
  • Implement and adhere to security and compliance controls within cloud environments, following corporate IT security policies, standards, and regulatory requirements (e.g., data protection, privacy) in all cloud configuration and deployment activities.
  • Work closely with Security and Compliance teams to remediate vulnerabilities, manage cloud access and encryption keys, and ensure that cloud infrastructure and processes pass audits and meet governance standards.
  • Maintain accurate documentation for configurations and changes as part of compliance and change management processes.
  • Excellent communication skills, both written and verbal, with the ability to document procedures and clearly convey technical information to teammates and stakeholders, effectively collaborating with cross-functional teams (Cloud Ops, DevOps, Developers, Security, Vendors) to implement solutions and resolve issues.

Benefits

  • Opportunities for growth and advancement
  • Diverse and inclusive work environment
  • Extensive training, mentoring and continuous development
  • Access to training and development platforms
  • Associate Discount Program
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service