At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM deployments. We are seeking an experienced Senior Software engineer to work closely with our technical and research teams on vLLM, llm-compressor, speculators, llm-d, create DevOps and CI/CD infrastructure, and scale our current technology stack. If you are someone who wants to contribute to solving challenging technical problems at the forefront of AI Inference, this is the role for you! You would be joining the core team behind 2025's most popular open source project on GitHub. In this role, your primary responsibility will be to build and release the Red Hat AI Inference Server, continuously improve the processes and tooling used by the DevOps team, and find opportunities to automate procedures and tasks. Join us in shaping the future of AI!
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level