Senior Software Engineer, Research

Sully.ai

46d

About The Position

At Sully.ai, We’re Building the Most Impactful Healthcare Company on Earth We believe that access to a great doctor is a basic human right. Today, that’s not a reality. Delays, misdiagnoses, administrative chaos, and burnout plague the system. Our Mission: One Human, One Doctor. We build AI teammates that augment clinicians — scribes, nurses, receptionists, translators — all powered by our own world-class models and deployed in real-world care. Our Traction 450+ organizations signed 16 months AI agents cut admin by ~2.8 hours daily and reduce onboarding 85%. 5M+ Clinical Tasks completed to date, serving 36+ specialties. Raised $25M from YC, Eric Yuan, Amity, Semper Virens Patented AI architecture (MedCon-1) outperforms GPT-4.5, Gemini, Claude on clinical reasoning tasks Sully requires A-players capable of 4 months = 1 year output. What You’ll Do Build and optimize core research infrastructure: evaluation pipelines, agent workflows, hallucination detectors, coding benchmarks, and research→production integrations. Design, implement, and scale agentic systems across backend, frontend, and model integrations, collaborating closely with research and co-founders. Own reliability, observability, and performance across agents (logging, tracing, instrumentation, safety checks). Ship research-proven features into production within 7 days, end-to-end. Develop shared tools, SDKs, and internal products that accelerate iteration across Research, QA, and Engineering.

Requirements

Senior-level full-stack engineering experience in React, TypeScript, and Node.js.
Proven ability to design, ship, and scale LLM-powered applications.
Expertise in API design, streaming, and CI/CD pipelines.
Strong cloud infrastructure background (AWS, GCP, or Azure).
Track record of building reliable systems with measurable performance and error budgets.

Responsibilities

Build and optimize core research infrastructure: evaluation pipelines, agent workflows, hallucination detectors, coding benchmarks, and research→production integrations.
Design, implement, and scale agentic systems across backend, frontend, and model integrations, collaborating closely with research and co-founders.
Own reliability, observability, and performance across agents (logging, tracing, instrumentation, safety checks).
Ship research-proven features into production within 7 days, end-to-end.
Develop shared tools, SDKs, and internal products that accelerate iteration across Research, QA, and Engineering.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume