Intelliswift Software-posted 3 months ago
Senior
Santa Clara, CA
Professional, Scientific, and Technical Services

As a Senior Software Engineer, you will be responsible for developing and maintaining the infrastructure required to deploy, monitor, and manage machine learning models efficiently and effectively. This role is focused on building ML-Ops solutions, but general software engineering skills are sufficient. The work is critical in bridging the gap between research and engineering, ensuring that our AI solutions are scalable, reliable, and seamlessly integrated into our products. This role requires you to thrive in a fast-paced environment, be passionate about AI/ML, and be constantly looking for ways to optimize and automate machine learning workflows.

  • Implement, optimize, and maintain CI/CD pipelines for ML systems, including integrations with GitHub workflows and Jenkins.
  • Partner with data scientists, frontend engineers, and platform teams to deliver seamless integration of ML models into core evaluation platforms.
  • Administer ML development/production environments using cloud-native solutions; optimize for scalability, reliability, and cost.
  • Evaluate, build, and deploy automation tools to streamline the end-to-end ML lifecycle.
  • Enhance and develop quality evaluation features and ensure robust monitoring via dashboards and automated alerts.
  • Champion engineering best practices, promote code quality, and document workflows, tools, and processes for effective team adoption.
  • Master's in computer science or related STEM field.
  • Minimum 5 years in software engineering; at least 2 years dedicated to DevOps/MLOps in cloud and production environments.
  • Industry experiences building end-to-end software pipelines and infrastructure with deep experience with Kubernetes, Infrastructure as Code (Terraform, CloudFormation), AWS, and GCP.
  • Expert proficiency in Python; working knowledge of ML frameworks (e.g., PyTorch, TensorFlow, MLflow).
  • Practical experience with cloud and NoSQL databases such as DynamoDB; SQL databases a plus.
  • Skilled with GitHub Actions, Jenkins, GitLab CI, Docker, and related automation platforms.
  • Strong problem-solving skills and the ability to work collaboratively across teams.
  • Strong knowledge of ML-Ops.
  • Exposure to Computer Vision, Generative AI (GAN, CLIP, Diffusion, MLLM), and their practical deployment for evaluation systems.
  • Experience in integrating ML workflows with user-facing features and backend pipelines.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service