Join us at Saige Partners, where we aim to shape your future and be the solution that propels your career forward! This is a contract role for a Security Detection and Response Lead, onsite in San Jose, CA. The role involves working across the full stack, from host memory and device behavior in Linux to node agents and container runtime interactions, to Kubernetes resource orchestration, controllers, and cluster-level lifecycle management. This is a hands-on systems role for an engineer comfortable working across boundaries often split between kernel and cloud platform teams. The position will architect and implement end-to-end systems software for advanced platform and memory technologies across Linux and Kubernetes environments. Solutions will span Linux kernel and host OS integration, user-space system services and node agents, container runtime/kubelet interaction points, and Kubernetes APIs, controllers, CRDs, and orchestration workflows. The role will develop software for resource discovery, reservation, activation, resize, release, rollback, and reconciliation in clustered environments. It also involves building and maintaining Kubernetes operators, controllers, and automation services using Go, client-go, controller-runtime, and related frameworks. Collaboration with Linux, firmware, and hardware teams is key to translate low-level platform capabilities into safe, observable, Kubernetes-manageable workflows. Work will focus on Linux-side integration for memory lifecycle, hotplug behavior, NUMA awareness, cgroups, device/resource management, and system telemetry. Defining interfaces between host software and cluster control-plane components, including state models, failure handling, and recovery behavior, is also part of the role. Development of node-local software and control-plane services that coordinate host state, platform services, and Kubernetes objects, along with driving architecture for reconciliation, idempotency, concurrency control, and fault recovery across distributed components, will be required. Creating observability for controller behavior, node readiness, resource lifecycle progress, and failure analysis is essential. Partnering with internal teams across kernel, systems architecture, firmware, validation, and platform software to bring proof-of-concept software into robust internal infrastructure, and supporting bring-up, debugging, validation, and performance tuning in lab and pre-production environments are also key responsibilities. Join the team to help shape the future of enterprise storage technology, with work impacting global performance-driven solutions for demanding applications.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Career Level
Senior
Education Level
No Education Listed