Senior Cloud Infrastructure

Judgment LabsSan Francisco, CA
1dOnsite

About The Position

We are Judgment. We build infrastructure for Agent Behavior Monitoring (ABM): surfacing silent behavioral issues, understanding how agents behave in production, and turning interaction data into actionable signals. Hundreds of teams building autonomous agents rely on Judgment to understand how their systems are behaving post-deployment. When something breaks, they’re not stuck in reactive incident triage. They can see which behaviors are trending, which configurations caused regressions, and what to actually fix. We've raised $30M+ across two rounds in the past five months. Our investors include Lightspeed, SV Angel, Valor Equity Partners, Nova Global, Chris Manning, Michael Ovitz, Michael Abbott, Cory Levy, Kevin Hartz, and others. The Role: We are looking for a Senior Cloud Infrastructure Engineer to architect and scale the deployment infrastructure that powers agent behavior monitoring at production scale. This role is crucial for enabling enterprise customers to run Judgment in their environments—whether that's multi-region cloud, self-hosted, or BYOC deployments—while maintaining the security, compliance, and reliability standards they require. We need someone who has built distributed systems that handle real production traffic and can own infrastructure from architecture through operations.

Requirements

  • Deep expertise across multiple cloud platforms (AWS, GCP, Azure) including compute, networking, storage, and managed services
  • Experience designing and operating multi-region, highly available cloud infrastructure
  • Strong knowledge of cloud networking (VPCs, load balancers, DNS, CDN, service mesh)
  • Expertise in Infrastructure as Code (Terraform, Pulumi, CloudFormation) and GitOps practices
  • Experience with self-hosted deployments and BYOC architectures, including customer environment setup and lifecycle management
  • Design and implement secure network architectures for customer-managed cloud accounts and on-premises environments
  • Build automation for provisioning, upgrades, and maintenance in customer-controlled infrastructure
  • Senior-level ownership: you will own infrastructure roadmap, architecture design, set practices, identify bottlenecks, ship fixes.

Nice To Haves

  • Experience with air-gapped and restricted network environments
  • Knowledge of private connectivity solutions (AWS PrivateLink, Azure Private Link, GCP Private Service Connect)
  • Familiarity with enterprise security requirements including SOC 2, ISO 27001, HIPAA, and FedRAMP compliance frameworks
  • Senior Infrastructure Engineer from observability company (Datadog/Sentry/Honeycomb), Enterprise AI startups (Harvey, Glean), Infrastructure SaaS (Databricks/Snowflake)

Responsibilities

  • Design and implement multi-region cloud architecture with automatic failover and disaster recovery across AWS, GCP, and Azure.
  • Architect and deploy regional compliance solutions (data residency, sovereignty) for enterprise customers in different geographies.
  • Design and implement self-hosted and BYOC (Bring Your Own Cloud) deployment architectures for enterprise customers with strict security requirements.
  • Build secure VPC peering and private connectivity solutions for customer-managed environments
  • Develop automated provisioning systems for on-premises and hybrid cloud deployments
  • Create customer-facing documentation and deployment guides for self-service infrastructure setup
  • Design enterprise-grade security architectures including network isolation, encryption at rest and in transit, and identity management integration (SSO, SAML, SCIM).
  • Build monitoring and observability solutions for distributed self-hosted deployments with centralized logging and alerting.

Benefits

  • Competitive salary and equity, full benefits, chef-cooked meals daily, gym access, and whatever tools or resources you need to do your best work.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service