CAE-posted 7 days ago
Full-time • Mid Level
Tampa, FL
5,001-10,000 employees

About This Role Who We Are: CAE Vision: Our vision is to be the worldwide partner of choice in defense and security, and civil aviation by revolutionizing our customers’ training and critical operations with digitally immersive solutions to elevate safety, efficiency and readiness. CAE Defense & Security Mission: CAE's Defense and Security business unit focuses on helping prepare military customers to develop and maintain the highest levels of mission readiness. CAE Values: Empowerment, Innovation, Excellence, Integrity and OneCAE make us who we are and we strive to make a difference in the world while helping each other succeed. What We Have to Offer: Comprehensive and competitive benefits package and flexibility that promotes work-life balance A work environment where all employees are valued, respected and safe Freedom to succeed by enabling team members to deliver, take initiatives and make decisions Recognition, professional development, advancement and having fun! Summary Essential Duties and Responsibilities Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions. Design, develop, and deploy machine learning models for real-world applications. Build scalable data pipelines and model training workflows using modern tools and frameworks. Conduct rigorous model evaluation, validation, and performance tuning. Monitor and maintain deployed models, ensuring reliability and accuracy over time. Design, fine-tune, and deploy LLMs (LLaMA, Mistral, etc.) for various NLP tasks such as summarization, question answering, semantic search, and chatbots. Develop scalable and efficient model serving infrastructure using tools like ONNX, TensorRT, DeepSpeed, or vLLM. Implement retrieval-augmented generation (RAG) pipelines using vector databases (e.g., FAISS, Weaviate, Pinecone, Milvus). Optimize LLM inference for latency, throughput, and cost across cloud and edge environments. Collaborate with cross-functional teams to understand business requirements and translate them into ML solutions. Stay current with the latest research and trends in machine learning, AI & LLMs. Mentor junior engineers and contribute to team knowledge sharing. Document processes, models, and decisions for transparency and reproducibility.

  • Design, develop, and deploy machine learning models for real-world applications.
  • Build scalable data pipelines and model training workflows using modern tools and frameworks.
  • Conduct rigorous model evaluation, validation, and performance tuning.
  • Monitor and maintain deployed models, ensuring reliability and accuracy over time.
  • Design, fine-tune, and deploy LLMs (LLaMA, Mistral, etc.) for various NLP tasks such as summarization, question answering, semantic search, and chatbots.
  • Develop scalable and efficient model serving infrastructure using tools like ONNX, TensorRT, DeepSpeed, or vLLM.
  • Implement retrieval-augmented generation (RAG) pipelines using vector databases (e.g., FAISS, Weaviate, Pinecone, Milvus).
  • Optimize LLM inference for latency, throughput, and cost across cloud and edge environments.
  • Collaborate with cross-functional teams to understand business requirements and translate them into ML solutions.
  • Stay current with the latest research and trends in machine learning, AI & LLMs.
  • Mentor junior engineers and contribute to team knowledge sharing.
  • Document processes, models, and decisions for transparency and reproducibility.
  • Bachelor’s or Master’s degree in Computer Science, Machine Learning, or related field. PhD is a plus.
  • 5+ years of software development experience, with at least 2 years focused on NLP or LLMs.
  • Proficiency in ML frameworks (PyTorch, TensorFlow, Scikit-learn, CUDA).
  • Good understanding of distributed systems, understanding of microservice architecture and REST APIs.
  • Strong understanding of MLOps tools and practices (MLflow, Airflow, DVC).
  • Hands-on experience with Hugging Face Transformers, LangChain, and OpenAI APIs.
  • Technology proficiency with cloud platforms (AWS, GCP, Azure), Linux, and container orchestration (Docker, Kubernetes).
  • Proven track record of deploying ML models in production environments.
  • Experience in working with SQL/NoSQL database systems such as MySQL, MongoDB or Elasticsearch.
  • Due to U.S. Government contract requirements, only U.S. citizens are eligible for this role.
  • Must comply with all company security and data protection / usage policies and procedures.
  • Personally responsible for proper marking and handling of all information and materials, in any form.
  • Shall not divulge any information, or afford access, to other employees not having a need-to-know.
  • Shall not divulge information outside company without management approval.
  • All government and proprietary information will be accessed and stored electronically on company provided resources.
  • Incumbent must be eligible for DoD Personal Security Clearance.
  • Due to U.S. Government contract requirements, only U.S. citizens are eligible for this role.
  • Experience with deep learning, NLP, computer vision, or reinforcement learning.
  • Experience with feature engineering and model interpretability techniques.
  • Knowledge of prompt engineering and prompt optimization strategies.
  • Experience with multi-modal models (e.g., combining text with image or audio inputs).
  • Familiarity with distributed training and model parallelism.
  • Experience with fine-tuning LLMs using LoRA, QLoRA, or PEFT techniques is a plus.
  • Familiarity with CI/CD pipelines and version control (Git).
  • Ability to work in a fast-paced, agile development environment.
  • Comprehensive and competitive benefits package and flexibility that promotes work-life balance
  • A work environment where all employees are valued, respected and safe
  • Freedom to succeed by enabling team members to deliver, take initiatives and make decisions
  • Recognition, professional development, advancement and having fun!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service