As a software engineer for GenAI inference, you will help design, develop, and optimize the inference engine that powers Databricks' Foundation Model API. You'll work at the intersection of research and production, ensuring our large language model (LLM) serving systems are fast, scalable, and efficient. Your work will touch the full GenAI inference stack - from kernels and runtimes to orchestration and memory management.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Industry
Professional, Scientific, and Technical Services
Number of Employees
5,001-10,000 employees