Bright Vision Technologies is seeking a skilled Model Serving Engineer to join their dynamic team. This role focuses on designing, building, and operating high-performance, highly reliable inference platforms for serving large machine learning models in production. The position emphasizes the systems engineering aspects of AI deployment, including request routing, batching, caching, autoscaling, GPU utilization, and end-to-end observability across diverse model workloads. The ideal candidate will possess strong distributed systems and performance engineering expertise, have experience deploying serving systems at scale, and understand the trade-offs involved in ML serving concerning latency, throughput, cost, and quality.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior