Staff AI Engineering

basalt.health
3hRemote

About The Position

We're building tools that help healthcare teams work faster and smarter. Our software handles the tedious parts of patient care coordination so that clinicians can focus on patients. We ship fast, iterate constantly, and believe AI should amplify human capability, not replace it. You'll own the intelligence layer of our platform—the orchestration systems that coordinate LLM calls, the retrieval pipelines that surface relevant medical history, and the evaluation frameworks that keep it all honest. You're a generalist who can and will ship across the stack, but you geek out on AI infrastructure. You've opinions on prompt management, you've debugged token limits at 2am, and you know that "it works in the playground" means nothing. You're pushing the limits of your bot swarm farther every day to maximize your impact and excited for the future. You might: Improve our LangGraph orchestration to handle complex clinical workflows Build retrieval pipelines that search patient records using embeddings and vector similarity Set up Langfuse/LangSmith tracing to debug why a summarization chain is hallucinating Fine-tune a model on medical terminology or clinical note structure Evaluate Gemini vs Claude vs GPT for specific healthcare tasks Design the data pipeline that turns unstructured clinical docs into searchable vectors reliably and responsibly Write the TypeScript or Python service(s) that ties it all together

Requirements

  • Hands-on LLM experience. You've built with OpenAI, Anthropic, or Google APIs. You understand context windows, temperature, and when to use which model.
  • Orchestration chops. LangChain, LangGraph, or similar. You know how to chain calls, handle failures, and manage state.
  • Observability instincts. You've used Langfuse, LangSmith, or Phoenix. You know that production AI without tracing is flying blind.
  • Retrieval/RAG experience. You've built vector search with Pinecone, Weaviate, pgvector, or Vertex AI Matching Engine. You understand chunking strategies, embedding models, and reranking.
  • Generalist coding ability. You can write Python for ML pipelines and TypeScript for services. You're not afraid of infrastructure and you're not too caught up in the moment to pitch in on the boring stuff.

Nice To Haves

  • Fine-tuning experience (LoRA, PEFT, or full fine-tunes)
  • Healthcare/FHIR/clinical data background
  • Experience with Gemini models and Vertex AI
  • Evaluation frameworks (RAGAS, custom evals, human-in-the-loop)
  • You've made embeddings work on messy, real-world data

Responsibilities

  • Improve our LangGraph orchestration to handle complex clinical workflows
  • Build retrieval pipelines that search patient records using embeddings and vector similarity
  • Set up Langfuse/LangSmith tracing to debug why a summarization chain is hallucinating
  • Fine-tune a model on medical terminology or clinical note structure
  • Evaluate Gemini vs Claude vs GPT for specific healthcare tasks
  • Design the data pipeline that turns unstructured clinical docs into searchable vectors reliably and responsibly
  • Write the TypeScript or Python service(s) that ties it all together

Benefits

  • Competitive salary + equity
  • Health insurance (we're a healthcare company—we get it)
  • Remote-first, async-friendly
  • Small team where your work ships to production, not a backlog
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service