ML Engineer

PerformacentricIndianapolis, IN
Remote

About The Position

Performacentric helps small and mid-market organizations improve profitability, efficiency, visibility, employee performance, customer satisfaction, and supplier performance through custom AI agents, intelligent automation, and connected business systems. We are building a next-generation AI platform powered by open-source large language models, agentic workflows, and business process automation. We are seeking a Machine Learning Engineer to help design, deploy, and optimize AI solutions built on Llama models and modern Python-based application architectures.

Requirements

  • 3+ years of professional software engineering experience.
  • Strong proficiency in Python.
  • Experience building APIs with FastAPI.
  • Experience deploying and working with Llama 3 8B or similar open-source LLMs.
  • Understanding of prompt engineering and LLM optimization techniques.
  • Experience consuming and developing REST APIs.
  • Strong understanding of Git-based development workflows.
  • Familiarity with Linux environments and command-line tools.
  • Experience troubleshooting and optimizing production applications.
  • Understanding of machine learning fundamentals.
  • Experience evaluating AI model performance.
  • Familiarity with embeddings, vector search, and RAG architectures.
  • Knowledge of model inference optimization techniques.
  • Experience working with structured and unstructured datasets.

Nice To Haves

  • Fine-tuning open-source LLMs.
  • ML Engineering and MLOps practices.
  • LangChain, LlamaIndex, Haystack, or similar frameworks.
  • PostgreSQL database administration and optimization.
  • Vector databases such as pgvector, Chroma, Pinecone, Weaviate, or Qdrant.
  • Docker and containerized deployments.
  • Kubernetes orchestration.
  • Azure AI infrastructure and GPU environments.
  • CI/CD pipelines and DevOps automation.
  • Multi-agent AI architectures.
  • Knowledge graph implementations.
  • Business intelligence and analytics platforms.

Responsibilities

  • Deploy, configure, and optimize Llama 3 8B models for production use.
  • Develop prompt engineering, retrieval, and agentic workflows.
  • Fine-tune and evaluate LLM performance for business use cases.
  • Implement Retrieval-Augmented Generation (RAG) architectures.
  • Optimize inference performance, latency, and infrastructure utilization.
  • Monitor model quality and continuously improve response accuracy.
  • Build scalable AI applications using Python and FastAPI.
  • Design and maintain RESTful APIs for AI services.
  • Develop backend services supporting AI agents and copilots.
  • Integrate AI solutions with CRM, ERP, communication, and business systems.
  • Implement authentication, authorization, and API security controls.
  • Write clean, maintainable, and well-documented code.
  • Build and maintain vector database integrations.
  • Develop data ingestion and preprocessing pipelines.
  • Support deployment of AI workloads in cloud and self-hosted environments.
  • Collaborate on model serving, monitoring, logging, and observability.
  • Assist with infrastructure automation and CI/CD processes.
  • Work closely with product, engineering, and leadership teams.
  • Participate in architecture discussions and technical planning.
  • Contribute to AI solution design for client implementations.
  • Mentor junior developers and share best practices.

Benefits

  • Remote-first work environment.
  • Competitive compensation based on experience.
  • Professional growth opportunities in one of the fastest-growing areas of software development.
  • Ability to help shape the future of AI-powered business transformation.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service