Inception creates the world’s fastest, most efficient AI models. Our Mercury model is the world’s fastest reasoning LLM and first commercially available diffusion LLM, delivering 5x greater speed and efficiency than today’s LLMs, with best-in-class quality. We are the AI researchers and engineers behind such breakthrough AI technologies as diffusion models, flash attention, and DPO. We seek experienced Full Stack engineers to build the infrastructure and applications for our cutting-edge AI. In this role, you will design and develop both frontend and backend systems that enable users to interact with our diffusion LLMs, creating intuitive interfaces and robust APIs capable of serving billions of requests per day. You'll work closely with our ML engineers and researchers to bridge the gap between complex AI models and user-friendly applications.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level