We're seeking a Machine Learning Engineer to join our Developer Experience team and own the operational deployment and performance optimization of our AI coding infrastructure. You'll be the expert who ensures our code generation models run reliably and efficiently at scale, powering the systems that help developers write software. You'll work across the full model lifecycle. This includes fine-tuning open source models for code generation tasks and implementing RLHF pipelines to improve code quality and align with developer workflows, then taking those customized models and deploying them at scale. You'll evaluate and test bleeding-edge code models as they're released, debug distributed inference frameworks like vLLM, SGLang, and Ray, resolve GPU memory allocation issues, manage CUDA dependencies and kernel compatibility, and navigate the ever-shifting landscape of ML library ecosystems. Your primary focus will be maximizing the throughput and capacity we can extract from our GPU infrastructure, turning experimental code models and fine-tuned variants into production-ready systems that generate code at scale for thousands of developers.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Industry
Publishing Industries
Education Level
Bachelor's degree
Number of Employees
5,001-10,000 employees