Data Platform Operations Engineer

ScotiabankToronto, ON
Onsite

About The Position

The Data Platform Operations Engineer will play a critical role within the Enterprise Data & AI Technology organization - one of Scotiabank’s most significant enterprise-wide strategic initiatives. This organization drives data enabled decision making, AI innovation, and technology modernization across the Bank. The Data Platform Operations Engineer works under the guidance of senior engineers and platform leads to help maintain site reliability for our Data & AI platforms. This role focuses on monitoring alerts and dashboards, completing routine operational and maintenance tasks using predefined SOPs/runbooks, triaging and escalating incidents, and providing after-hours on-call support on a rotational basis. You will partner with teams such as IAM, Network, Cloud Ops, Security, and client delivery teams to resolve issues and keep the platform stable, secure, and available.

Requirements

  • 2 years of experience in IT operations, production support, or a similar support role.
  • Foundational knowledge of cloud concepts (Azure preferred): identity/access basics, networking fundamentals, and how to navigate cloud portals and logs.
  • Comfort monitoring alerts/dashboards and troubleshooting using logs and metrics (Azure Monitor/Log Analytics or similar tools).
  • Basic scripting ability (Python, Bash, or PowerShell) to run operational checks or automate simple repetitive tasks.
  • Familiarity with ITIL-style incident and change processes (ticketing, triage, documentation, and handoffs) is an asset.
  • Strong communication and customer support mindset: able to provide clear updates, follow SOPs precisely, and escalate effectively.
  • Willingness to participate in after-hours/on-call rotations as required.

Nice To Haves

  • Exposure to data platforms (Databricks, Spark, SQL warehouses, or similar) is an asset; ability to learn quickly is essential.
  • Degree/college diploma in Computer Science, Engineering, Information Technology, or a related field is preferred (or equivalent practical experience).

Responsibilities

  • Monitor dashboards and alerts (Azure Monitor/Log Analytics, Databricks, and platform tooling), validate signal vs. noise, and take first-response actions according to SOPs.
  • Triage incidents by collecting logs/metrics, identifying likely impact, documenting findings, executing approved remediation steps, and escalating to on-call leads or SMEs with clear context.
  • Perform routine maintenance tasks from predefined runbooks (e.g., operational checks, certificate/secret rotations as directed, housekeeping activities, basic platform validation, scheduled jobs health checks).
  • Work intake from service queues, follow standard procedures for common requests (access requests, connectivity validation, workspace onboarding steps), and ensure requests are completed and communicated within agreed SLAs.
  • Provide timely and accurate updates during incidents and maintenance activities, including status, next steps, and handoffs, using established communication channels and templates.
  • Identify recurring issues and operational pain points, suggest improvements to alerts/runbooks, and contribute to post-incident actions (e.g., updating SOPs, adding monitoring coverage).
  • Maintain clear operational documentation, ensure runbooks are current, and capture lessons learned to improve onboarding and reduce time-to-resolve.
  • Participate in after-hours/on-call rotations and perform approved response actions, escalating when required to meet service reliability targets.

Benefits

  • Upskilling through online courses, cross-functional development opportunities, and tuition assistance.
  • Competitive Rewards program including bonus, flexible vacation, personal, sick days and benefits will start on day one.
  • Free tea & coffee, universal washrooms, and lots of space for team collaboration.
  • Opportunities for community engagement & belonging with our various programs.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service