Senior ML Engineer

InvocaCanada, KY
$152,000 - $228,000Remote

About The Position

Invoca is seeking a Senior ML Engineer to lead the productionization layer of their ML stack. This role involves owning model serving, inference optimization, fine-tuning, and the associated APIs and pipelines. The engineer will be a key contributor to the infrastructure powering Invoca's Context Engine and agentic AI workflows, collaborating closely with Data Scientists, Data Engineers, and Applied AI Engineers. The position is remote-first and based in specific US and Canadian locations.

Requirements

  • 5+ years of ML Engineering experience with a strong production focus
  • Advanced Python and deep learning proficiency (PyTorch, HuggingFace Transformers, spaCy)
  • Demonstrated track record deploying and maintaining transformer-based NLP models in production
  • Hands-on experience fine-tuning SLMs/LLMs (LoRA, QLoRA, PEFT) and optimizing models via quantization, batching, and throughput tuning
  • Proficiency with inference infrastructure: Triton, Baseten, vLLM, TGI, SageMaker, Vertex AI, or similar
  • Experience building production-grade APIs that expose ML models to downstream consumers
  • Familiarity with MLOps tooling, model monitoring, and eval platforms (Braintrust, MLflow, or equivalent)
  • B.S. in Computer Science, Engineering, Statistics, or equivalent; advanced degree a plus

Nice To Haves

  • Familiarity with RLHF or preference training is a bonus

Responsibilities

  • Lead End-to-End MLOps and Productionization: Architect, implement, and maintain CI/CD pipelines for ML artifacts — including model evaluation, versioning, and automated deployment.
  • Serve as the primary SME for operational excellence across the Invoca ML stack.
  • Design and Optimize SLM/LLM Deployment: Own the full inference infrastructure: model serving on Triton Inference Server, Baseten, and Kubernetes-based GPU infrastructure.
  • Profile and tune for low latency and high throughput, and build robust, scalable APIs for internal and external model access.
  • Fine-Tune Language Models: Apply parameter-efficient fine-tuning methods (LoRA, QLoRA, PEFT) to adapt transformer-based SLMs and LLMs for high-impact NLP applications in conversation intelligence.
  • Evolve ML Infrastructure: Contribute to model training infrastructure, data pipelines, and data lake foundations to keep the systems powering our models reliable and scalable.
  • Collaborate Across Teams: Partner closely with Data Scientists, Data Engineers, and Applied AI Engineers to build the foundational ML systems behind Invoca's agentic AI products.
  • Deliver Customer Value: Work with product and engineering to understand customer needs and ship ML solutions that make a measurable difference.

Benefits

  • Flexible Time Off
  • Paid Holidays (16 U.S. paid holidays, including a winter break)
  • Health Benefits (medical, dental, and vision coverage, with multiple plan options)
  • Fertility assistance
  • 401(k) plan through Fidelity with a company match of up to 4%
  • Stock Options
  • Mental Health Program (SpringHealth program)
  • Paid Family Leave (up to 6 weeks of 100% paid leave for baby bonding, adoption, and caring for family members)
  • Paid Medical Leave (up to 12 weeks of 100% paid leave for childbirth and medical needs)
  • InVacation (bonus after 7 years of service)
  • Wellness Subsidy (for gym memberships, fitness classes, and more)
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service