Who are we? Komodor is a cutting-edge Kubernetes platform built by developers, for developers. We help engineering and infrastructure teams manage complex systems with ease, efficiency, and transparency – so they can focus more on innovation and less on firefighting Kubernetes challenges. Our platform is trusted by thousands of teams worldwide, with standout capabilities like Klaudia – our AI-powered Kubernetes failure detection and analysis engine that delivers real-time insights to dev and infra teams; the Cost team, helping companies dramatically reduce cloud spend; the Health team, building industry-leading troubleshooting features; and our Operations group, crafting powerful Kubernetes-native agents, operators, and controllers. Your mission We’re looking for a highly skilled Software Engineer to join the team behind Podmotion : Komodor’s cutting-edge live migration engine for Kubernetes workloads. Your mission is to design, build, and optimize the next generation of pod mobility using CRIU-based checkpoint/restore , enabling stateful containers to move across nodes while preserving their full runtime memory state and more! In this role, you’ll develop low-level container runtime integrations, extend Kubernetes-native operators, and contribute to the open-source ecosystem around workload mobility. You will work closely with our Cost and Platform Engineering teams to push the limits of what Kubernetes is capable of; from zero-downtime maintenance to automated node evacuation and self-healing systems. You will shape the architecture, reliability, performance, and developer experience of Podmotion and help define how modern cloud-native systems achieve true live transparency and resiliency. Why This Role Rocks? You’ll work on one of the most technically ambitious features in Kubernetes today You’ll be breaking new ground in how workloads move, recover, and self-heal in distributed environments. You get to work deeply with Kubernetes internals , container runtimes , cgroups , Linux namespaces , and Linux kernel features . You’ll contribute to a high-impact open-source ecosystem used by companies running large-scale production clusters. You’ll join a team of talented engineers who love solving hard distributed-systems challenges. You’ll see your work help thousands of teams improve reliability, reduce costs, and achieve operational excellence.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
51-100 employees