Threat Detection and Response Analyst

Saige PartnersSan Jose, CA
Onsite

About The Position

Join us at Saige Partners, where we aim to shape your future and be the solution that propels your career forward! This is a contract role for a Security Detection and Response Lead, onsite in San Jose, CA. The role involves working across the full stack, from host memory and device behavior in Linux to node agents and container runtime interactions, to Kubernetes resource orchestration, controllers, and cluster-level lifecycle management. This is a hands-on systems role for an engineer comfortable working across boundaries often split between kernel and cloud platform teams. The position will architect and implement end-to-end systems software for advanced platform and memory technologies across Linux and Kubernetes environments. Solutions will span Linux kernel and host OS integration, user-space system services and node agents, container runtime/kubelet interaction points, and Kubernetes APIs, controllers, CRDs, and orchestration workflows. The role will develop software for resource discovery, reservation, activation, resize, release, rollback, and reconciliation in clustered environments. It also involves building and maintaining Kubernetes operators, controllers, and automation services using Go, client-go, controller-runtime, and related frameworks. Collaboration with Linux, firmware, and hardware teams is key to translate low-level platform capabilities into safe, observable, Kubernetes-manageable workflows. Work will focus on Linux-side integration for memory lifecycle, hotplug behavior, NUMA awareness, cgroups, device/resource management, and system telemetry. Defining interfaces between host software and cluster control-plane components, including state models, failure handling, and recovery behavior, is also part of the role. Development of node-local software and control-plane services that coordinate host state, platform services, and Kubernetes objects, along with driving architecture for reconciliation, idempotency, concurrency control, and fault recovery across distributed components, will be required. Creating observability for controller behavior, node readiness, resource lifecycle progress, and failure analysis is essential. Partnering with internal teams across kernel, systems architecture, firmware, validation, and platform software to bring proof-of-concept software into robust internal infrastructure, and supporting bring-up, debugging, validation, and performance tuning in lab and pre-production environments are also key responsibilities. Join the team to help shape the future of enterprise storage technology, with work impacting global performance-driven solutions for demanding applications.

Requirements

  • Hands-on systems role for an engineer comfortable working across boundaries that are often split between kernel and cloud platform teams.
  • Experience with Go, client-go, controller-runtime, and related frameworks.
  • Familiarity with Linux kernel and host OS integration.
  • Understanding of user-space system services and node agents.
  • Knowledge of container runtime / kubelet interaction points.
  • Experience with Kubernetes APIs, controllers, CRDs, and orchestration workflows.
  • Experience with resource discovery, reservation, activation, resize, release, rollback, and reconciliation in clustered environments.
  • Ability to collaborate with Linux, firmware, and hardware teams.
  • Knowledge of Linux-side integration for memory lifecycle, hotplug behavior, NUMA awareness, cgroups, device/resource management, and system telemetry.
  • Ability to define interfaces between host software and cluster control-plane components, including state models, failure handling, and recovery behavior.
  • Experience developing node-local software and control-plane services that coordinate host state, platform services, and Kubernetes objects.
  • Experience driving architecture for reconciliation, idempotency, concurrency control, and fault recovery across distributed components.
  • Ability to create observability for controller behavior, node readiness, resource lifecycle progress, and failure analysis.
  • Experience partnering with internal teams across kernel, systems architecture, firmware, validation, and platform software.
  • Experience supporting bring-up, debugging, validation, and performance tuning in lab and pre-production environments.

Responsibilities

  • Work across the full stack: from host memory and device behavior in Linux, to node agents and container runtime interactions, to Kubernetes resource orchestration, controllers, and cluster-level lifecycle management.
  • Architect and implement end-to-end systems software for advanced platform and memory technologies across Linux and Kubernetes environments.
  • Design solutions that span: Linux kernel and host OS integration, user-space system services and node agents, container runtime / kubelet interaction points, Kubernetes APIs, controllers, CRDs, and orchestration workflows.
  • Develop software for resource discovery, reservation, activation, resize, release, rollback, and reconciliation in clustered environments.
  • Build and maintain Kubernetes operators, controllers, and automation services using Go, client-go, controller-runtime, and related frameworks.
  • Collaborate with Linux, firmware, and hardware teams to translate low-level platform capabilities into safe, observable, Kubernetes-manageable workflows.
  • Work on Linux-side integration for areas such as memory lifecycle, hotplug behavior, NUMA awareness, cgroups, device/resource management, and system telemetry.
  • Define interfaces between host software and cluster control-plane components, including state models, failure handling, and recovery behavior.
  • Develop node-local software and control-plane services that coordinate host state, platform services, and Kubernetes objects.
  • Drive architecture for reconciliation, idempotency, concurrency control, and fault recovery across distributed components.
  • Create observability for controller behavior, node readiness, resource lifecycle progress, and failure analysis.
  • Partner with internal teams across kernel, systems architecture, firmware, validation, and platform software to bring proof-of-concept software into robust internal infrastructure.
  • Support bring-up, debugging, validation, and performance tuning in lab and pre-production environments.

Benefits

  • Comprehensive benefits package
  • Convenient weekly payment solutions
  • Promoting health and a positive work-life balance
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service