Anthropic is seeking an Engineering leader to lead the API Core team within API. API Core owns the hot path of the Claude API —/v1/messages and the request lifecycle that sits in front of every inference call Anthropic serves. As Claude's usage continues to scale, the efficiency, throughput, and reliability of the core API directly determine how much capacity we can deliver per chip, how quickly we can onboard new workloads, and how fairly we allocate constrained inference resources across customers. The work spans service-level efficiency (improvements of latency-sensitive paths), throughput scaling (RPS improvements, request multiplexing, connection management), rate limiting and acceleration limits (the quota and fairness systems that mediate access to compute), and the foundational platform abstractions that the rest of the API organization builds on. The team operates at the intersection of product engineering and infrastructure, partnering deeply with Inference, Compute, and the broader Platform org to translate model-serving capacity into customer-facing throughput. This is a high-impact, high-visibility leadership role reporting to the Head of API Engineering. You will set technical direction, drive delivery against committed efficiency and capacity targets, and represent API Core in cross-org architecture and capacity-planning forums.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager