Senior LLM / RAG Engineer

PeratonReston, VA

About The Position

We are looking for a Senior‑Level Engineer to lead the development and sustainment of Retrieval‑Augmented Generation (RAG) AI prototypes for a national‑security mission. You will work directly with customer stakeholders to expand an existing prototype and deliver new LLM‑powered capabilities that help analysts understand and act on large volumes of proprietary data. In this role, you will combine data engineering, model serving, GPU‑based inference, and rapid application development to deliver high‑impact AI tools in a secure environment.

Requirements

  • Bachelor’s degree in an area relevant to the position with 12+ years of applicable experience OR a Master’s degree in an area relevant to the position with 10 years of applicable experience; an additional 4 years of applicable experience maybe considered in lieu of a degree.
  • Active TS/SCI or SCI eligibility and active polygraph or ability to obtain a polygraph
  • Strong hands‑on experience with AWS services (EC2, S3, IAM, container services)
  • Expertise with Python and Linux in operational environments
  • Ability to design, deploy, and manage Docker‑based workloads
  • Experience with GPU inference, model serving, and LLM fundamentals
  • Experience working with structured, semi‑structured, and unstructured data
  • Familiarity with CI/CD basics (Git, Jenkins, etc.)

Nice To Haves

  • Streamlit or similar tools for rapid UI development
  • Experience with vector stores (Milvus or comparable)
  • Familiarity with embedding generation and RAG pipeline tooling
  • Experience with sglang, Ray Serve, LlamaIndex, Hugging Face, or similar frameworks
  • AWS certifications
  • Knowledge of prompt engineering and evaluation best practices

Responsibilities

  • Maintain and extend current RAG prototypes to integrate new datasets and features
  • Build and optimize data ingest pipelines using Python and AWS services
  • Develop LLM/embedding pipelines and operate GPU inference workloads
  • Deploy and manage containerized services in Kubernetes‑like or Docker‑based environments
  • Implement vector search solutions using modern vector databases
  • Develop mission‑focused UIs using Streamlit or similar tools for rapid prototyping
  • Use tools such as sglang, Ray Serve, and LlamaIndex to operationalize LLM capabilities
  • Collaborate closely with analysts and mission leaders to understand use cases and rapidly iterate on prototypes
  • Ensure solutions follow customer security, compliance, and operational guidelines
  • Write technical documentation for deployments, APIs, and system behaviors

Benefits

  • Heavily subsidized employee benefits coverage for you and your dependents
  • 25 days of PTO accrued annually up to a generous PTO cap
  • Eligible to participate in an attractive bonus plan
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service