ServiceNow-posted 4 months ago
$187,600 - $328,300/Yr
Full-time • Senior
Pleasanton, CA
Professional, Scientific, and Technical Services

Join the Global Cloud Services organization as the founding member of our Cloud Analytics & FinOps Engineering Platform team. You will be instrumental in establishing the technical foundation and architectural direction for ServiceNow's next-generation FinOps governance platform. We are building a modern, secure, and highly scalable multi-cloud data platform infrastructure powering next-generation analytics to support ServiceNow's Cloud and AI growth. As our Senior Staff DevOps Engineer for Cloud Analytics & FinOps Engineering Platform, you will architect, secure, and operationalize our hybrid cloud data platform infrastructure spanning AWS, GCP, Azure, and on-premises systems. You will have ownership over CI/CD pipelines, infrastructure-as-code, platform security, cost optimization, observability, and data source integrations across our complex ecosystem while navigating ServiceNow's enterprise infrastructure standards and compliance requirements. This is a unique opportunity to build enterprise-grade platform infrastructure from the ground up, establish DevOps best practices for modern data platforms, and work within a Fortune 500 enterprise environment with global scale requirements.

  • Design and implement secure, scalable Kubernetes clusters across AWS EKS, GCP GKE, and Azure AKS supporting complex data platform workloads.
  • Architect hybrid cloud infrastructure with unified management and governance, building infrastructure-as-code solutions using Terraform, AWS CDK, and CloudFormation for repeatable deployments.
  • Establish multi-cloud networking including VPC design, cross-cloud connectivity, Transit Gateway configurations, and secure service mesh implementations while navigating ServiceNow enterprise standards and approval processes.
  • Implement comprehensive security frameworks across multi-cloud data platform stack adhering to enterprise security standards.
  • Design identity and access management across cloud providers following principle of least privilege, orchestrate secrets management using cloud-native solutions, and establish security scanning for container images and infrastructure.
  • Ensure compliance with SOC2, FedRAMP, and regulatory requirements while working with security teams to implement platform controls and data governance.
  • Design sophisticated CI/CD pipelines using Jenkins, GitHub Actions, TeamCity, and Argo CD for GitOps workflows.
  • Manage artifact repositories with automated image scanning and promotion, create Helm charts for complex data platform services (Trino, Airflow, Lightdash, Grafana), and establish automated testing pipelines for infrastructure changes with drift detection and remediation.
  • Architect comprehensive monitoring using Grafana, Prometheus, and CloudWatch with advanced alerting and incident response frameworks.
  • Design SLIs/SLOs/SLAs for data platform services with error budget management, establish SRE practices including toil reduction and capacity planning, and create operational dashboards for platform health and performance metrics.
  • Implement automated remediation workflows and capacity forecasting with predictive analytics.
  • Design secure data ingestion pipelines from disparate systems across multi-cloud and on-premises environments.
  • Implement data source connectors for billing systems, ServiceNow internal systems, SaaS platforms, and ML platforms.
  • Manage hybrid cloud connectivity and orchestrate complex data workflows using Apache Airflow with high availability across multiple cloud environments.
  • Implement automated scaling and resource management across cloud providers.
  • Establish Cloud Development Environment (CDE) platform using Coder to provision on-demand development workspaces via Terraform templates for global distributed teams, with enterprise compliance and cost optimization.
  • Work within ServiceNow enterprise processes for technology approvals and infrastructure changes.
  • Mentor junior engineers across global time zones on SRE best practices, establish operational runbooks for 24/7 platform support with automated incident response, and implement SRE organizational practices including error budget policies and reliability reviews.
  • Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving.
  • 10+ years of DevOps/Platform engineering experience with large-scale distributed systems in enterprise environments.
  • Expert-level Kubernetes knowledge across multiple cloud providers (EKS, GKE, AKS) including service mesh and cluster management.
  • Multi-cloud expertise across AWS, GCP, and Azure with deep understanding of platform strengths and cost models.
  • Advanced Infrastructure-as-Code experience with Terraform, CloudFormation, and AWS CDK.
  • Proven CI/CD pipeline management using GitHub Actions, Jenkins, Argo CD, and GitOps workflows in enterprise environments.
  • Strong security background with cloud security best practices and compliance frameworks (SOC2, FedRAMP).
  • Expertise in network security for cloud and Kubernetes environments, including VPC design, zero-trust networking, security policies, firewall rules, VPNs, and intrusion detection/prevention systems.
  • Enterprise navigation skills with large organization processes and cross-team collaboration.
  • Bachelor's degree in Computer Science, Engineering, or related technical field.
  • Full professional proficiency in English.
  • Data engineering background with modern data stack technologies.
  • Service mesh experience with Istio, Linkerd, or cloud-native solutions.
  • Enterprise platform experience at Fortune 500 companies.
  • Global team leadership across multiple time zones.
  • SRE certification or formal training from Google, AWS, or similar programs.
  • Chaos engineering experience with tools like Chaos Monkey, Litmus, or Gremlin.
  • Open-source contributions to DevOps, Kubernetes, or SRE tools.
  • Multi-cloud certifications (AWS Solutions Architect, Google Cloud Architect, Azure Architect, CKA/CKAD, Terraform Associate).
  • Performance engineering experience with large-scale distributed systems.
  • Base pay of $187,600 - $328,300, plus equity (when applicable), variable/incentive compensation and benefits.
  • Health plans, including flexible spending accounts.
  • 401(k) Plan with company match.
  • ESPP, matching donations.
  • Flexible time away plan and family leave programs.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service