The F3I-3 program at CACI is seeking an AI Platform Engineer to join their team and help solve one of their customer’s toughest problems. This role involves designing, architecting, and leading the development of AI/ML platform components and agentic AI applications. The engineer will implement and optimize AI/ML algorithms, write production-quality code primarily in Python (potentially Go, C, C++, or Rust), and deploy container-based applications using CI/CD best practices and MLOps infrastructure on platforms like Red Hat OpenShift and public clouds (AWS, GCP, Azure). The position requires staying current with Generative AI advancements, performing root cause analysis, and solving complex problems in a dynamic environment. A key responsibility is architecting and maintaining data pipelines for training data, model artifacts, and inference logs within a governed data lake. The role also includes designing, implementing, and operating a unified MLOps platform for both on-premises and cloud-hosted Kubernetes clusters, enabling rapid onboarding of new Agentic AI services and ensuring consistent governance. Collaboration with research scientists, data scientists, product teams, and stakeholders is essential to translate prototypes into production-grade services, ensuring reproducibility, security, and compliance. Mentoring junior engineers and contributing to knowledge bases are also part of the role. Performance optimization for inference workloads (GPU/CPU scaling, model quantization, batching strategies) and championing best practices in security, cost efficiency, and disaster recovery for hybrid infrastructure are critical.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior