About The Position

We are looking for an experienced full-stack Software Engineer to join our AI infrastructure team and help build the next generation of AI capabilities for our customer. This is not a junior role — you'll be expected to independently design, implement, and operate scalable AI infrastructure components from day one, and to bring real technical leadership to everything you touch. In this role, you will be at the center of a platform that powers production AI services, including retrieval augmented generation (RAG), autonomous agents, and other emerging technologies. You'll help ensure that platform is fast, reliable, observable, and secure — and you'll help the engineers around you level up while you're at it.

Requirements

  • 8 years of relevant experience, or 4 additional years of experience in lieu of a B.S. in a technical discipline
  • Proven experience building and maintaining production systems at scale
  • Experience with high-volume web application architecture and performance optimization
  • Strong background in systems integration across diverse technologies and platforms
  • Hands-on experience with cloud engineering in AWS
  • Proficiency with Kubernetes administration and deployment patterns
  • Strong Python programming skills
  • Experience implementing observability solutions (APM, OpenTelemetry, Grafana, Prometheus)
  • Familiarity with CI/CD pipelines and DevOps practices
  • Strong change management and organizational influence skills
  • Ability to thrive in ambiguous environments and create structure where needed
  • Excellent communication and collaboration skills
  • An active TS/SCI clearance with polygraph

Nice To Haves

  • Experience with AI inference serving technologies such as vLLM or LiteLLM
  • Previous experience with agentic frameworks such as LangChain
  • Knowledge of vector databases and embedding systems
  • Experience with high-performance computing or distributed systems

Responsibilities

  • Designing, implementing, and optimizing infrastructure for AI model inference at scale
  • Supporting the development and maintenance of production AI services and applications, including RAG pipelines, autonomous agents, and emerging AI technologies
  • Navigating ambiguity and defining solutions for underspecified systems and requirements
  • Driving adoption of new technologies and engineering practices across the team
  • Implementing monitoring, logging, and observability solutions for AI services
  • Automating infrastructure provisioning and configuration using Infrastructure as Code (IaC) principles
  • Ensuring high availability, reliability, and performance of AI platform components
  • Contributing to security best practices for AI systems and data
  • Providing technical guidance and informal mentorship to junior engineers

Benefits

  • Top salaries
  • Pick your PTO (3 to 5 weeks)
  • All 11 federal holidays, paid
  • Up to 2 snow days, paid
  • 4x match on the first 6% of 401(k) contributions (up to 24% company match)
  • 100% employer-paid medical, dental, vision, life, and disability insurances (or salary boost if already covered)
  • $5,250 annual education assistance
  • Spot bonuses
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service