4Minds is an enterprise AI fine-tuning platform that transforms how organizations build and operate private, domain-specific AI. Unlike static systems, 4Minds’s AI platform learns continuously from live data in real time and can be deployed on-prem or your cloud provider. Our patented technologies scale existing engineering teams and empower new AI teams, enabling rapid AI deployment, adaptation, and ROI. Through 4Minds’s automated data pipeline and proprietary knowledge graph, enterprises can connect all their data sources, including Microsoft, Databricks, AWS and Google, creating adaptive AI that surpasses the capabilities of conventional RAG-based systems. As Machine Learning Ops Engineer at 4Minds, you will own the infrastructure that makes our AI platform perform, scale, and ship across the most demanding deployment environments in the enterprise market: GCP, AWS, Azure, CoreWeave, and on-premise. This isn't a role where you maintain what others built. You'll actively research, evaluate, and drive improvements across every layer of the stack, from inference pipeline reliability to GPU performance optimization across hardware architectures. Working in close partnership with the CTO, you'll take on initiatives that sit at the frontier of what's possible with modern AI infrastructure. Our platform's ability to deploy privately, on-premise or in any cloud, is a core product promise, and you're the engineer who makes that promise real at scale. This is a senior, hands-on role on a focused engineering and research team. You'll bring production discipline to a system that demands it, while continuously pushing the boundaries of how we scale, optimize, and extend our infrastructure as the platform grows.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior