About The Position

We are looking for a skilled and passionate Senior Site Reliability Engineer (SRE), based on the East Coast of the United States to join the Cloud Platform team, which empowers DataSnipper's growth through a secure and scalable enterprise cloud platform. As a Senior SRE at DataSnipper, you will set the strategic direction for our cloud infrastructure on Microsoft Azure. You will define target-state architectures and roadmaps, lead enterprise-scale landing zone design and governance, and partner with product, SRE, security, and data teams to deliver multi-tenant, multi-region, secure-by-default solutions. You'll standardize patterns, automate with Infrastructure as Code, and guide migrations and modernizations, turning best practices into measurable reliability, security, and cost outcomes.

Requirements

  • 7+ years in cloud architecture or platform engineering, with deep hands-on expertise in Microsoft Azure and experience setting cloud strategy and roadmaps
  • Proven track record designing multi-tenant, multi-region SaaS architectures and enterprise-scale Azure Landing Zones with strong governance and policy
  • Expertise across Azure services: AKS/Container Apps, App Service, VMSS; VNet/vWAN, Private Link, Azure Firewall, App Gateway/WAF, Front Door; Entra ID (Azure AD), RBAC, Managed Identity, PIM; Storage, Azure SQL DB; Service Bus/Event Grid; Key Vault; Defender for Cloud; Azure Monitor/Log Analytics/App Insights
  • Strong DevOps/SRE practices: CI/CD (GitHub Actions), GitOps, blue/green and canary deployments, infrastructure testing, and progressive delivery
  • Hands-on with Infrastructure as Code (Terraform and/or Bicep; ARM), policy-as-code, and environment bootstrapping at scale
  • Solid grasp of networking and hybrid connectivity (ExpressRoute, VPN), security-by-design, and zero trust
  • FinOps mindset with demonstrable cost optimization, tagging/chargeback, budgets/alerts, and rightsizing
  • Strong communication and stakeholder management skills; ability to influence across product, SRE, security, and leadership
  • Proficiency in scripting/coding (PowerShell and one of Python/C#/Go)

Nice To Haves

  • Azure Solutions Architect Expert (AZ-305)
  • Azure DevOps Engineer Expert (AZ-400)
  • CKA/CKAD
  • Experience in regulated environments (SOC 2, ISO 27001, HIPAA, GDPR)
  • Contributions to public docs/reference architectures

Responsibilities

  • Define and own the cloud infrastructure strategy, reference architectures, and platform roadmaps for Azure across compute, networking, identity, data, security, and observability
  • Design and implement an enterprise-scale Azure Landing Zone (management groups, subscriptions, RBAC, Azure Policy) and governance for multi-tenant SaaS and regulated customers
  • Architect highly available, multi-region solutions leveraging services such as AKS/Container Apps, App Service, Azure DB for PostgreSQL, Redis, Service Bus/Event Grid, Front Door/Traffic Manager, and CDN
  • Enable secure private connectivity patterns (Private Link, VNet integration, Azure Firewall/WAF, ExpressRoute/VPN) and champion zero-trust principles with Entra ID and Managed Identity
  • Establish platform engineering 'golden paths' and reusable accelerators: Terraform modules, environment bootstrapping, and CI/CD templates in GitHub Actions
  • Drive well-architected reviews for mission-critical workloads; translate findings into actionable improvements for reliability, security, performance, and cost optimization with measurable SLOs/SLIs
  • Implement end-to-end observability using Azure Monitor, Log Analytics, Application Insights, and (where applicable) Prometheus/Grafana; automate proactive detection and post-incident improvement plans
  • Partner with Security to implement least-privilege access, PIM, Defender for Cloud, Key Vault, secret rotation, and compliance controls (e.g., SOC 2, ISO 27001)
  • Define and validate DR/BCP strategies (RTO/RPO), including zone-redundancy, geo-replication, backups, and failover testing
  • Mentor and coach engineering teams; lead architecture reviews, threat modeling, technical workshops, and author clear documentation and reference architectures
  • Evaluate and guide adoption of new Azure capabilities; collaborate with partners and vendors to enhance our platform

Benefits

  • Excellent salary
  • Flexible paid time off
  • Remote work
  • Comprehensive medical and dental coverage
  • 401K match
  • Paid parental leave
  • Stock participation plan
  • Being part of one of the fastest-growing scale-ups in the world
  • Make an impact by disrupting the finance industry with us
  • A flexible and growing organization with lots of opportunities to learn and develop
  • International working environment, with a team of friendly and driven colleagues
  • Access to OpenUp and Talkspace, the mental health and wellness platform
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service