Multimodal Analysis Framework (MAF) is an end‑to-end platform designed to process diverse content sources—including video, images, audio, and documents—to generate rich, structured metadata. The platform unifies multiple ML/AI models to extract curated insights at scale, tailored to specific business needs. MAF supports both on‑demand workloads (batch uploads, ad‑hoc analysis) and real‑time streaming workflows, enabling continuous metadata generation for live content streams. Customers can define their metadata requirements—such as entity extraction, scene segmentation, object detection, transcription, summarization, or multimodal correlation—and the framework orchestrates the appropriate models and toolchains to deliver high‑quality outputs. Through flexible APIs and UI‑based workflows, customers and internal teams can visualize metadata, trigger enrichment, monitor processing, and integrate results into downstream applications. The platform emphasizes modularity, scalability, and extensibility to support new ML models, LLM‑based agents, and cross-modal inference as use cases evolve. We are looking for a mid-level Backend Engineer to join our Machine Learning Platform team. This role focuses on building scalable backend systems that power ML workloads, including video, image, and document processing, and enable LLM-driven applications through agents and MCP servers. You will work primarily in Golang, deploy and operate services on Kubernetes, manage infrastructure with Terraform, and build on AWS. A core part of the role is designing platform capabilities that allow LLMs to safely and reliably interact with tools, data, and services via agent frameworks and MCP servers.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level