AI Engineer

Ethos Life
7d$146,000 - $236,000Remote

About The Position

We’re building several LLM-powered copilots across critical workflows (e.g., underwriting productivity, agent enablement, customer support, operations/compliance, fraud). We need an AI engineer to own the LLM + retrieval + context layer that makes these copilots accurate, auditable, fast, and cost-efficient. Typical stack: Python/FastAPI, Postgres + vector (pgvector/Pinecone/Weaviate), OpenSearch, optional graph DB, Kubernetes + GPUs, OTEL/Datadog

Requirements

  • 5+ years building production systems; 2+ years hands-on LLMs/RAG
  • Proven RAG experience (embeddings, vector DBs, hybrid search, reranking, eval)
  • Strong backend/distributed systems + observability
  • Track record shipping in high-stakes environments with auditability/correctness

Nice To Haves

  • Knowledge graph / entity resolution / provenance systems
  • GPU inference optimization (vLLM/TGI/TensorRT-LLM, quantization AWQ/GPTQ, batching)
  • Regulated domain experience (insurance/fintech/healthcare)

Responsibilities

  • Production RAG: indexing, retrieval, hybrid search, reranking, query rewriting, grounding, citations
  • Context Graph: entity resolution + linking + provenance; graph + vector retrieval; supports multi-hop context
  • LLM orchestration: tool/function calling, structured outputs, routing across model tiers, failure modes
  • GPU/inference cost optimization: batching, caching/KV reuse, quantization, autoscaling; optimize $/session + latency
  • Safety + compliance: PII/PHI handling, redaction, audit logs, deterministic replay, hallucination mitigation
  • LLMOps: eval harness (golden sets, regression, adversarial), monitoring for quality/cost/drift
  • Design/ship the end-to-end pipeline: retrieve → assemble context → generate → cite → log/monitor
  • Improve quality and trust via evaluation, feedback loops, and clear evidence-backed outputs
  • Partner with product, security, and domain teams; write crisp design docs; raise engineering bar
  • Ship RAG v1 with citations + measurable quality metrics
  • Deliver Context Graph v1 that improves retrieval on real copilot tasks
  • Reduce cost/latency with a concrete inference optimization plan shipped to prod

Benefits

  • The US national base salary range for this full-time position is $146,000 - $236,000.
  • Our salary ranges are determined by role, level, and location.
  • The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations.
  • Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.
  • Please note that the compensation details listed in US role postings reflect the base salary only and do not include applicable bonus, equity, or benefits.
  • You can find further details of our US benefits at https://www.ethoslife.com/careers/

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

251-500 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service