HiringCafe is building a job search engine that aims to be 100x better than existing platforms like Indeed and LinkedIn. They are looking for a founding ML engineer to help transform AI and ML models into efficient, reliable production systems. This role will focus on deploying models, optimizing inference performance (latency and throughput), scaling serving systems, and ensuring efficient production operation of models. It's a hands-on engineering position for individuals passionate about model performance, GPU utilization, inference architecture, and production reliability.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed