About The Position

NVIDIA is defining the next era of computing by tapping into the unlimited potential of AI, an era where our GPU acts as the brains of computers, robots, and self-driving cars. Joining the OpenShell team offers a unique opportunity to work on a highly advanced platform that enables this future. This core system provides secure, sandboxed runtimes essential for autonomous AI agents. The OpenShell platform is sophisticated, incorporating a control-plane gateway, a privacy-conscious inference router, declarative policy enforcement, and specialized container and VM-based sandbox execution environments. This is a chance to make a lasting impact on the world alongside some of the most forward-thinking and hardworking people on the planet.

Requirements

  • Minimum of a Bachelor's degree in Computer Science, Electrical Engineering, or a related technical field, or equivalent experience.
  • 8+ years of meaningful experience.
  • Proficiency in systems programming, including building and debugging long-running services, async runtimes, and handling OS-level integration.
  • Deep knowledge of distributed systems/control planes, including reasoning about state divergence, building reconciliation loops, and designing crash recovery paths.
  • Experience with Container/Sandbox Internals, managing isolated workloads, process lifecycle, capabilities, and network namespaces.
  • Familiarity with gRPC and Protobuf, including crafting machine-to-machine APIs with clean streaming semantics and version safety.
  • Experience operating and extending workloads on Kubernetes, including working with compute drivers, image management, and detailed debugging.
  • Ability to secure inter-service communication using mTLS, gateway registration flows, and non-browser identity verification.
  • Proficiency in instrumenting systems with structured logging, health checks, and distributed tracing for production observability.

Nice To Haves

  • Familiarity with virtualization technologies and alternative runtimes, such as microVMs (e.g., libkrun).
  • Experience improving operator experience through CLI/TUI development, status reporting, and clear error messages.
  • Comfort working at cross-language boundaries, specifically between Rust, Python, protobuf codegen, and shell scripting.

Responsibilities

  • Work across the full stack of a distributed systems platform, from crafting gRPC contracts to building secure sandbox runtimes.
  • Implement and harden network security features, including policy enforcement, L4/L7 proxies, and secure inter-service communication using mTLS.
  • Develop core platform components such as inference routing, ensuring model provider adapters, credential management, and protocol translation integrate seamlessly with the sandbox and gateway.
  • Build reliable configuration and control plane systems that handle state divergence, implement reconciliation loops, and support safe merging and hot-reloading policies.
  • Own the operability experience by creating effective CLI tools, managing release automation, and instrumenting all systems for observability with structured logging and distributed tracing.

Benefits

  • equity
  • benefits
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service