Infrastructure Engineer - Contract - Hybrid

PelmorexOakville, ON
Hybrid

About The Position

The Infrastructure Engineer - Meteorological Development role combines infrastructure engineering, platform reliability, and software development to deliver highly available, scalable, and high-performance systems to support 4 billion global locations with high quality data. Working in a modern cloud-native environment, you will design and operate distributed systems on Kubernetes (AWS EKS & GCP GKE) with Istio service mesh and AWS ECS using Docker containers, infrastructure as code (Terraform & Terragrunt), and GitOps/CI/CD pipelines (on Jenkins). You will work across AWS and/or GCP services, manage application delivery with Helm, and support stateful and caching layers such as Redis (AWS Elasticache). You will ensure system reliability through Linux-based operations and observability tooling including Prometheus, Grafana, Kibana, ElasticSearch, Jaeger and Kiali. Implement and maintain playbook and runbooks using bash scripts and Ansible. Working closely with technical leads, developers, operations teams, and infrastructure administrators, this role helps modernize and standardize infrastructure through automation, observability, and cloud/platform engineering best practices. The Infrastructure Engineer plays a key role in ensuring platform stability, scalability, security, and operational excellence across critical meteorological systems and data pipelines. Provide support to Pelmorex’s meteorological infrastructure and platforms that powers The Weather Network, MeteoMedia, ElTiempo.es, and subsidiaries.

Requirements

  • Degree or diploma in Computer Science, Engineering, Mathematics, or equivalent practical experience.
  • Minimum 3+ years of experience designing, implementing, and managing containerized environments using Docker and Kubernetes across platforms such as CloudStack, Proxmox, EKS & GKE.
  • Strong knowledge of cloud platforms and infrastructure technologies including AWS, GCP, CloudStack, and Proxmox.
  • Experience with core networking concepts including TCP/IP, NAT, DNS, load balancing, and firewalls.
  • Strong experience building and supporting production-grade distributed systems in Linux environments.
  • Experience developing production applications in one or more of the following languages: Go, Python, C/C++, or Rust.
  • Strong scripting and infrastructure automation experience using technologies such as Python, Bash, Terraform, Helm, Terragrunt, Groovy, or JavaScript.
  • Understanding of data structures, algorithms, and performance optimization concepts.
  • Experience designing and supporting highly available, scalable systems handling large volumes of data.
  • Experience with relational and NoSQL databases, including caching technologies such as Redis or AWS ElastiCache.
  • Experience with CI/CD practices and tools such as Jenkins or similar platforms.
  • Experience building and consuming REST-based APIs and services.
  • Familiarity with security best practices and secure infrastructure design principles.
  • Experience with monitoring and observability tools such as Prometheus, Grafana, ElasticSearch, Kibana, Jaeger and Kiali.
  • Understanding of cloud cost optimization, including balancing performance, reliability, scalability, and operational efficiency.
  • Strong problem-solving and communication skills with the ability to gather requirements, whiteboard solutions, and collaborate effectively across teams.
  • Experienced in maintaining operational runbooks, automation playbooks, and reliable cloud infrastructure procedures for production systems.

Nice To Haves

  • Background in meteorology
  • Keen interest in data and data mining, problem solving
  • Experience with spatiotemporal data sets and scientific data formats including netCDF, HDF, GRIB and BUFR would be a plus.
  • Experience in Ansible.
  • Education or experience in applying ML/AI to augment data analysis
  • Experience with AI/ML infrastructure or platforms is considered an asset.

Responsibilities

  • Design and implementation of Meteorological infrastructure core services that are currently running under Kubernetes (AWS EKS) with Istio Service mesh, EC2 instances, ECS Clusters and deployed under Jenkins CI/CD pipeline via Helm, SSM and bash deployment scripts. Core services run with AWS S3, SQS, SNS, FSx, ECS, EC2, Elasticache and ALB/NLB resources.
  • Improve monitoring system running under Prometheus, Grafana and Kibana.
  • Use AI-assisted development tools and workflows to improve productivity, accelerate delivery, enhance code quality, support troubleshooting, and streamline operational tasks while maintaining quality and accountability.
  • Provide technical advice and produce effort estimates on the various work required; execute work based on priorities.
  • Work with cross-functional teams to ensure all systems function as intended and without impediment to existing products or processes. This includes the development and/or integration of monitoring systems.
  • Work with Technology Operations team to ensure end-to-end data monitoring in software design, from data acquisition to delivery.
  • Use knowledge of databases and database technology to create applications which are distributed, multi-tier, and able to service large volumes of data.
  • Provide support to our operational teams for the relevant production systems and ensure services stay available and stable.
  • Maintaining operational runbooks, automation playbooks, and reliable cloud infrastructure procedures for production systems.

Benefits

  • Flexible Work Environment
  • Summer Hours
  • Paid Vacation Days Off (in addition to Personal Days)
  • An IDEAS (Inclusion, Diversity, Equity, Awareness, Solidarity) team
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service