DevOps Release Manager

Arize AI
Hybrid

About The Position

AI is rapidly transforming the world. As generative AI reshapes industries, teams need powerful ways to monitor, troubleshoot, and optimize their AI systems. That’s where we come in. Arize AI is the leading AI & Agent Engineering observability and evaluation platform, empowering AI engineers to ship high-performing, reliable agents and applications. From first prototype to production scale, Arize AX unifies build, test, and run in a single workspace—so teams can ship faster with confidence. We’re a Series C company backed by top-tier investors, with over $135M in funding and a rapidly growing customer base of 150+ leading enterprises and Fortune 500 companies. Customers like Booking.com, Uber, Siemens, and PepsiCo leverage Arize to deliver AI that works. Our On-Prem Engineering team owns the deployment of Arize within customer-managed environments. Beyond partnering with customers to define infrastructure requirements, the team designs and builds the systems that enable Arize to run seamlessly across a wide range of cloud and on-premise infrastructures. Over time, the team has developed deep expertise in packaging and delivering features from our SaaS platform into reliable, production-ready releases for self-hosted customers. It’s a small, highly dynamic group that operates with a high degree of autonomy, ownership, and initiative. In this role, you’ll focus on building, packaging, and delivering the Arize stack using a mix of technologies including Bazel, Go, Java, and Python. This is a unique opportunity to work on foundational infrastructure and play a key role in shaping how organizations observe, monitor, and evaluate AI systems in their own environments.

Requirements

  • 5+ years of experience working with high-performance backend systems.
  • Enthusiasm and interest in the AI and LLM ecosystem, with a desire to learn and stay updated on emerging technologies.
  • Previous work building and operating highly complex platforms/systems.
  • Knowledge of working with public clouds & container orchestration - AWS, GCP, Azure, Kubernetes, etc.
  • Ability to operate with autonomy and ownership in a small, fast-moving team.

Nice To Haves

  • Familiarity with packaging, release engineering, or reproducible build systems (e.g., Bazel) is a strong plus.

Responsibilities

  • Design, build, and maintain scalable backend systems that power the deployment of Arize in customer-managed (on-prem and cloud) environments.
  • Develop tooling and infrastructure to package, test, and deliver the Arize platform as reliable, production-ready self-hosted releases.
  • Work across the stack using Go, Java, Python, and Bazel to build reproducible builds and deployment pipelines.
  • Partner with customers to understand infrastructure constraints and translate them into robust deployment architectures.
  • Build and optimize services that support high-volume analytics workloads in resource-constrained or isolated environments.
  • Improve system reliability, observability, and upgradeability for distributed deployments.

Benefits

  • medical
  • dental
  • vision
  • 401(k) plan
  • unlimited paid time off
  • generous parental leave plan
  • others for mental and wellness support
  • WFH monthly stipend to pay for co-working spaces
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service