About The Position

Database Reliability Engineer & Administrator (PostgreSQL & ClickHouse) TCN is looking for a Database Reliability Engineer (DBRE) to join our team in Saint George, Utah. In this role, you will be the guardian of our data layer, ensuring that TCN’s global production databases are performant, scalable, and resilient. While you will share the core DNA of a Site Reliability Engineer, your specific mission is to optimize and manage our high-traffic PostgreSQL environments and our high-performance ClickHouse analytical clusters. You will bridge the gap between application development and database operations, ensuring our data infrastructure keeps pace with our global growth. About TCN TCN is a fast-growing technology company and provides all its services over the internet in a cloud-based software-as-a-service model. TCN's technology stack and culture are positive and forward-thinking. When you join TCN, you are joining a dedicated team of professionals. Employees often describe our culture as friendly, collaborative, flexible, and fast-paced. To learn more, visit our website.

Requirements

  • Education: Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • Experience: 3+ years in a Linux environment as a Database Administrator (DBA), DBRE, or SRE with a heavy focus on data systems.
  • Database Expertise: PostgreSQL: Advanced knowledge of internal mechanics, replication (physical/logical), and extension management.
  • ClickHouse: Experience managing OLAP workloads, understanding of MergeTree engines, and distributed table configurations.
  • Cloud & Containers: Demonstrated experience with Google Cloud Platform (GCP) and running stateful workloads inside Kubernetes .
  • Linux Mastery: Deep understanding of the Linux OS, specifically how kernel parameters, storage I/O, and networking impact database performance.
  • Automation: Proficient in scripting (Bash, Python) and configuration management.
  • Networking: Knowledge of TCP/IP, TLS encryption for data in transit, and load balancing (L4/L7).
  • Soft Skills: Excellent communication skills; the ability to explain "why" a query is slow to a developer and "what" the business impact is to a stakeholder.

Nice To Haves

  • Familiarity with Go is a significant plus.

Responsibilities

  • Database Architecture & Deployment: Collaborate with developers to design schema migrations and deploy database changes that maintain high availability.
  • Assist in the architectural design of PostgreSQL and ClickHouse clusters to ensure they meet scaling requirements.
  • Performance Tuning & Optimization: Proactively monitor and tune PostgreSQL (query optimization, indexing strategies, vacuuming) and ClickHouse (merge tree optimizations, shard/replica management) to ensure sub-second latency for our clients.
  • Infrastructure as Code: Automate the provisioning and configuration of database clusters using tools like Terraform, Ansible, or Kubernetes Operators.
  • Resilience & Failure Management: Manage high-availability (HA) solutions (e.g., Patroni, PGBouncer) and ensure robust backup/recovery strategies are tested and functional.
  • Deep Troubleshooting: Lead root-cause analysis for complex database incidents. Debug locking issues, replication lag, and resource contention in a cloud-native environment.
  • Observability: Build and maintain dashboards and alerting for database health, focusing on SLIs/SLOs related to data consistency and query performance.
  • Incident Response: Participate in a 24/7 on-call rotation, serving as the subject matter expert for database-related outages.

Benefits

  • Medical Insurance (HDHP with HSA)
  • Dental Insurance
  • Vision Insurance
  • Life Insurance
  • 401k with employer match
  • Competitive salary
  • Paid time off
  • Paid holidays (11 scheduled)
  • Weekly lunches; free drinks and snacks
  • Casual dress and flexible work environment
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service