Senior Software Engineer - Infrastructure, Machine Learning

Baton (A Ryder Technology Lab)San Francisco, CA
106d$200,000 - $250,000

About The Position

As a Senior Software Engineer within our Machine Learning Team, you will tackle complex challenges in distributed systems and ML operations to enhance our machine learning infrastructure. You’ll build scalable ML infrastructure from the ground up - supporting model deployment, distributed training, real-time inference, and more. You’ll be a key partner to the Data Science team, helping bring value to production quickly and reliably. This role requires a blend of advanced Python programming skills within production environments and expertise in distributed computing.

Requirements

  • Advanced Python proficiency in large-scale production environments.
  • Experience building scalable backend or ML infrastructure using distributed computing techniques.
  • Strong background in AWS and cloud-native data/compute services.
  • Hands-on experience with distributed training pipelines, model serving, and monitoring.
  • Deep familiarity with SQL (OLTP & OLAP), feature engineering, and caching patterns.

Nice To Haves

  • 5 to 8 years of backend or ML infrastructure experience.
  • Proven track record building production ML workflows at scale.
  • Experience in industry logistics, transportation, or freight is a bonus.

Responsibilities

  • Build and scale distributed systems for ML training, serving, and inference.
  • Design and implement real-time ML workflows that power core product features.
  • Build robust distributed systems tailored for efficient ML training and seamless operational deployment.
  • Streamline and manage both online and offline feature stores, optimizing feature engineering processes for greater efficiency.
  • Improve real-time machine learning workflows to support dynamic decision-making and automate core operational processes.
  • Lead the development of ML Ops systems, including model deployment, monitoring, and experiment tracking.
  • Architect and manage scalable feature stores for online and offline usage.
  • Contribute to agentic AI systems for freight matching, ETA prediction, and load scheduling.
  • Support systems that improve Stop Estimation Accuracy and Cross-Mode Optimization.
  • Write production-grade Python that operates at scale, with reliability and performance top of mind.
  • Collaborate across engineering and data science to turn models into resilient software systems.

Benefits

  • Competitive Base Salary
  • Long Term Cash Incentive Plans
  • Annual Company Bonus
  • 401k with Matching
  • Hybrid Work Schedule
  • Comprehensive Health Coverage
  • Hyper-Stable, publicly traded Enterprise
  • Employee Stock Purchase Program (15% discount to market value)
  • Collaborative, Tech-Forward, Cozy Office environment in Hayes Valley
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service