As a Staff Platform Engineer - AI Infrastructure, you will be responsible for building and scaling the infrastructure behind Paytm's AI inference platform. This platform serves both internal teams and enterprise customers, supporting new customer use cases from the ground up. Your role will involve owning GPU infrastructure, model hosting and serving, and multi-model routing across various modalities. This includes managing Paytm's own coding and domain-specific models (voice, vision, risk, fintech workflows) as well as third-party models on shared GPU and accelerator clusters. You will also develop self-service platforms that enable teams to provision compute, deploy, customize models, and manage resources through APIs and control planes, eliminating the need to rebuild infrastructure for each AI use case. Your contributions will establish the AI control plane for Paytm Intelligence (Pi), encompassing policy-driven routing, quotas, observability, and visibility into usage and costs. This work will directly impact the speed of agent and AI feature deployment, their reliability, and the efficiency of hardware utilization across various domains like payments, risk, fraud, collections, support, and developer experience.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed