As a Principal Engineer on the Enterprise AI/ML Platform team, you will tackle the most complex technical challenges involved in delivering machine learning at enterprise scale. You will design, build, and evolve reliable, secure, and cost‑efficient platform capabilities—from model packaging and serving to observability and lifecycle management—working closely with multiple teams to ensure these capabilities are practical, robust, and widely usable in production. You will take a hands‑on role across enterprise repositories, improving shared services, CI/CD workflows, and infrastructure patterns where they have the greatest impact. This includes deep technical investigation of performance and scalability issues, such as tracking down bottlenecks in web services, analyzing system and application metrics, and optimizing GPU utilization, throughput, and resource efficiency across ML workloads.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Principal
Number of Employees
101-250 employees