About The Position

The Senior Staff Database Reliability Engineer exists to set technical direction and ensure the reliability of UKG’s SQL Server and PostgreSQL platforms across cloud and hybrid deployments. This role is accountable for defining reliability standards, leading complex technical initiatives, and driving automation that improves platform resilience while reducing operational toil. You will operate as a senior individual contributor and technical authority, influencing architecture decisions, mentoring other engineers, and partnering across teams to ensure database reliability is built into the platform by design.

Requirements

  • Demonstrated ability to design and operate enterprise SQL Server and/or PostgreSQL platforms supporting business‑critical workloads.
  • Demonstrated experience leading incident response and root cause analysis for production relational database systems.
  • Hands‑on experience with automation and scripting for database operations (e.g., PowerShell, Python, or equivalent).
  • Ability to define engineering standards, review designs, and influence technical decisions across multiple teams.

Nice To Haves

  • Experience operating SQL Server and PostgreSQL in cloud or hybrid environments, including IaaS or managed database services.
  • Experience implementing monitoring, alerting, and observability for large‑scale relational database platforms.
  • Experience improving operational maturity through automation, standardization, and reliability engineering practices.
  • Experience partnering with Security or Compliance teams to support data protection, audit, or regulatory requirements.

Responsibilities

  • Design, review, and evolve high‑availability and disaster recovery architectures for SQL Server and PostgreSQL to meet defined availability and resilience targets.
  • Lead incident response and root cause analysis for high‑severity database incidents, identifying systemic issues and implementing long‑term reliability improvements.
  • Define and maintain reliability standards, including SLOs, SLIs, error budgets, and operational readiness requirements for relational database services.
  • Build and maintain automation for database provisioning, configuration, patching, upgrades, backup validation, and disaster recovery testing.
  • Architect and operate SQL Server and PostgreSQL platforms in cloud and hybrid environments, including IaaS‑based and managed database offerings.
  • Implement and improve database observability using metrics, logs, and performance telemetry to enable proactive detection of availability, capacity, and performance issues.
  • Partner with application, SRE, and cloud engineering teams to review designs and ensure database reliability and scalability are incorporated early in the development lifecycle.
  • Establish and maintain runbooks and operational procedures, focusing on repeatability, automation, and reduced human dependency.
  • Mentor and provide technical guidance to DBREs and adjacent engineers to improve overall database platform maturity.

Benefits

  • Flexibility
  • Performance-based bonus plan
  • Restricted stock unit awards
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service