Thomson Reuters is seeking a Lead Inference Platform Engineer. This role is for someone who has specialized experience in machine learning/deep learning domains such as model compression, hardware aware model optimizations, hardware accelerators architecture, GPU/ASIC architecture, machine learning compilers, high performance computing, performance optimizations, numerics or SW/HW co-design. As a Lead Inference Platform Engineer, you will optimize LLMs and ML models for high-performance inference, deploy and scale inference workloads on GPUs across AWS, Azure, GCP and internal Kubernetes clusters, implement routing and failover strategies for OpenAI/Anthropic/Vertex AI traffic, integrate models into production grade APIs supporting TR products and enterprise workflows, develop highly optimized environments and eliminate performance bottlenecks to reduce latency, collaborate with Platform Engineering teams to ensure inference workloads align with TR’s cloud native patterns, build and optimize containerized inference pipelines using Kubernetes for large‑scale distributed workloads, ensure compliance with TR’s AI standards for deployment, monitoring, governance, and drift detection, profile inference performance, identify GPU/CPU bottlenecks, and optimize compute utilization across heterogeneous hardware, implement observability and health monitoring for inference pipelines, ensuring reliability of enterprise AI services, collaborate with platform teams to enhance capacity forecasting for AI workloads, work with Product, Data Science, Architecture, and Enterprise AI teams to onboard new research models into production, and collaborate closely with AI engineers to invent new quantization techniques, improve numerical precision, and explore non‑standard architectures. You will also partner with Cloud Engineers to develop guardrails and automation that support inference workload, and support the scale out of AI infrastructure during critical releases and global product rollouts.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed