Member of Technical Staff, Full Stack

Inception•San Francisco, CA

About The Position

Inception creates the world’s fastest, most efficient AI models. Our Mercury model is the world’s fastest reasoning LLM and first commercially available diffusion LLM, delivering 5x greater speed and efficiency than today’s LLMs, with best-in-class quality. We are the AI researchers and engineers behind such breakthrough AI technologies as diffusion models, flash attention, and DPO. We seek experienced Full Stack engineers to build the infrastructure and applications for our cutting-edge AI. In this role, you will design and develop both frontend and backend systems that enable users to interact with our diffusion LLMs, creating intuitive interfaces and robust APIs capable of serving billions of requests per day. You'll work closely with our ML engineers and researchers to bridge the gap between complex AI models and user-friendly applications.

Requirements

BS/MS/PhD in Computer Science, Machine Learning, or related field (or equivalent experience)
5+ years of experience building production web applications
Strong proficiency in modern JavaScript/TypeScript and Python
Experience with frontend frameworks (e.g., React) and state management solutions
Solid understanding of backend development, including API design, database management, and microservices architecture
Experience with SQL and NoSQL databases (e.g., Neon)
Familiarity with Kubernetes, CI/CD pipelines, and cloud infra (AWS and/or Azure).
Experience with version control (Git) and collaborative development workflows
Strong problem-solving skills and the ability to work in a fast-paced startup environment

Nice To Haves

Experience building applications that integrate with AI/ML systems
Knowledge of streaming architectures and real-time data processing
Experience with monitoring and observability tools (e.g, Prometheus and Grafana)
Understanding of ML concepts and experience with ML frameworks (PyTorch, TensorFlow)
Experience with infrastructure as code tools (e.g., Terraform)
Experience with testing frameworks and test-driven development
Experience with UI/UX design and Framer

Responsibilities

Design and develop scalable web applications and APIs for our models, building both frontend interfaces and backend services
Build systems for internal experimentation and monitoring that provide delayed observability of our entire tech stack
Develop RESTful APIs to expose model capabilities to external applications
Implement authentication, authorization, and security best practices for enterprise deployments
Collaborate with ML engineers to integrate model serving infrastructure with the application layer
Design infrastructure as code, deployment automation, and CI/CD pipelines