About The Position

At Sully.ai, We’re Building the Most Impactful Healthcare Company on Earth We believe that access to a great doctor is a basic human right. Today, that’s not a reality. Delays, misdiagnoses, administrative chaos, and burnout plague the system. Our Mission: One Human, One Doctor. We build AI teammates that augment clinicians — scribes, nurses, receptionists, translators — all powered by our own world-class models and deployed in real-world care. Our Traction 450+ organizations signed 16 months AI agents cut admin by ~2.8 hours daily and reduce onboarding 85%. 5M+ Clinical Tasks completed to date, serving 36+ specialties. Raised $25M from YC, Eric Yuan, Amity, Semper Virens Patented AI architecture (MedCon-1) outperforms GPT-4.5, Gemini, Claude on clinical reasoning tasks Sully requires A-players capable of 4 months = 1 year output. What You’ll Do Build and optimize core research infrastructure: evaluation pipelines, agent workflows, hallucination detectors, coding benchmarks, and research→production integrations. Design, implement, and scale agentic systems across backend, frontend, and model integrations, collaborating closely with research and co-founders. Own reliability, observability, and performance across agents (logging, tracing, instrumentation, safety checks). Ship research-proven features into production within 7 days, end-to-end. Develop shared tools, SDKs, and internal products that accelerate iteration across Research, QA, and Engineering.

Requirements

  • Senior-level full-stack engineering experience in React, TypeScript, and Node.js.
  • Proven ability to design, ship, and scale LLM-powered applications.
  • Expertise in API design, streaming, and CI/CD pipelines.
  • Strong cloud infrastructure background (AWS, GCP, or Azure).
  • Track record of building reliable systems with measurable performance and error budgets.

Responsibilities

  • Build and optimize core research infrastructure: evaluation pipelines, agent workflows, hallucination detectors, coding benchmarks, and research→production integrations.
  • Design, implement, and scale agentic systems across backend, frontend, and model integrations, collaborating closely with research and co-founders.
  • Own reliability, observability, and performance across agents (logging, tracing, instrumentation, safety checks).
  • Ship research-proven features into production within 7 days, end-to-end.
  • Develop shared tools, SDKs, and internal products that accelerate iteration across Research, QA, and Engineering.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service