Staff Site Reliability Engineer

TransUnionGreenwood Village, CO
Hybrid

About The Position

At TransUnion, this role will report to a DevOps Director. The Site Reliability Engineering team drives reliability strategy, elevates engineering standards, and owns some of the most complex and consequential work on the platform. This is a hybrid position and involves regular performance of job responsibilities virtually as well as in-person at an assigned TU office location for a minimum of two days a week. As a Staff Site Reliability Engineer at TransUnion, you will serve as a senior technical leader and force multiplier on the SRE team. Operating with full autonomy, you will drive reliability strategy, lead high-risk technical initiatives, and set the engineering standards that elevate the entire team. You’ll bring deep expertise across GCP, Kubernetes, CI/CD pipelines, and monitoring platforms — contributing to strategic decisions on major platform components while fully participating in on-call rotation. Whether stepping in to lead the team, owning complex capacity and security work, or anchoring incident response with calm and maturity, your impact will be felt across the platform and the people around you.

Requirements

  • 5+ years of experience in Cloud Architecture, Site Reliability Engineering, Platform Engineering, or related fields — with a proven track record of designing and delivering at enterprise scale.
  • Architectural authority — you don’t just contribute to technical decisions, you drive them. You’ve owned the design of large-scale, mission-critical systems from whiteboard to production.
  • Deep, hands-on expertise with Google Cloud Platform (GCP) and Kubernetes (K8s) — running high-volume, high-availability workloads with 99.999% reliability targets.
  • Mastery of CI/CD pipeline architecture — designing end-to-end delivery systems that are fast, safe, and built for scale.
  • Expert-level command of monitoring, observability, and alerting platforms (e.g., Datadog, Prometheus, Grafana, PagerDuty) — you define what good looks like.
  • Deep Linux expertise — from kernel internals and system performance tuning to hardening and troubleshooting at the OS level in production environments.
  • Strong command of database architecture — including relational (PostgreSQL, MySQL, Cloud SQL) and NoSQL (Bigtable, Firestore, Redis) systems, with experience designing for high availability, replication, failover, and performance at scale.
  • Advanced networking knowledge — including VPCs, subnets, DNS, load balancing, firewall rules, VPNs, private service connect, and hybrid connectivity patterns across cloud and on-prem environments.
  • Proven expertise in Infrastructure-as-Code (IaC) — designing and enforcing scalable, reusable frameworks using Terraform, Pulumi, or equivalent tools.
  • Strong proficiency in scripting and automation (e.g., Python, Bash, Go) — building the tools and workflows that eliminate toil and accelerate delivery.
  • Deep understanding of security architecture — including identity and access management (IAM), zero-trust principles, secrets management, encryption, and compliance frameworks (SOC 2, PCI-DSS, etc.).
  • Hands-on experience designing and integrating AI/ML-powered solutions into cloud-native platforms — including familiarity with LLM orchestration, vector databases, model serving infrastructure, and AI observability — with the ability to evaluate emerging tools and translate them into reliable, production-grade capabilities.
  • Proven ability to set and enforce architectural standards across multiple teams, driving consistency, automation, and operational maturity organization-wide.
  • A strategic thinker who can translate complex business requirements into resilient, scalable, and cost-efficient cloud-native solutions.
  • Exceptional communication skills — equally fluent presenting to engineers in the weeds and executives in the boardroom.
  • A calm, decisive presence in high-pressure situations — the person others look to when the stakes are highest.

Responsibilities

  • Technical Leadership & Strategic Influence Recognized expert across multiple systems; actively contributes to architectural and strategic decisions around major platform components. Leads research, testing, implementation, and continuous improvement for new systems and tooling. Performs complex, high-impact work including capacity planning, load testing, and security improvements.
  • Operational Excellence & On-Call Fully participates in the team’s on-call rotation; models calm, effective, and blameless incident response. Serves as a significant technical contributor during major incidents and problem resolution. Plans and leads high-risk maintenance events with minimal to no customer impact.
  • Standards & Team Elevation Elevates team standards through new tooling, processes, procedures, and effective communication. Capable of stepping in to lead and represent the team — a trusted resource during transitions or coverage gaps. Sets new professional benchmarks in technical quality, engineering culture, and cross-functional collaboration.

Benefits

  • Enjoy day-one eligibility for medical, dental, and vision coverage, plus supplemental plan options. Spousal, domestic partner, and other eligible dependent coverage is available on select plans. Choose tax‑advantaged HSA and FSA accounts to make everyday care more affordable.
  • We’ve got your back with company‑paid basic life and AD&D, optional voluntary life and AD&D for you and your family, and short‑ and long‑term disability. You can also opt into a legal plan, pet insurance, and travel accident coverage.
  • From adoption assistance and fertility planning coverage to caregiver support, we’re here for every chapter. Access Dependent Care FSA for possibility of an employer match, a complimentary Care@Work membership, and up to 12 weeks of paid parental leave with eligibility for a thoughtful, gradual return.
  • Build toward what’s next with our 401(k) with employer match and Employee Stock Purchase Plan (ESPP). Tap financial wellness resources, career coaching, and optional long‑term care insurance to plan confidently.
  • Grow and recharge with tuition reimbursement, flexible time off for exempt employees or paid time off for nonexempt employees, up to 12 paid holidays per year, commuter benefits, employee discounts, charitable gift matching, and paid volunteer time off, plus corporate volunteer events that make it easy to give back.
  • Access 24/7 support including professional therapy, coaching, and emotional well‑being programs alongside guided meditation and resources that support physical, mental, social, and financial wellness.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service