About The Position

This is a 100% remote role for a Senior Python Developer / Data Engineer specializing in ML Pipelines. The position involves designing and building large-scale data pipelines, working on ETL/ELT workflows, and constructing end-to-end ML pipelines. Collaboration with data scientists to productionize ML models, feature engineering, training pipelines, and model serving are key aspects. The role also focuses on ensuring data quality, monitoring, pipeline reliability, and optimizing systems for performance, scalability, and cost. A strong emphasis is placed on contributing clean, maintainable, production-grade Python code.

Requirements

  • 8+ years of software engineering experience with Python as primary language
  • Strong background in data engineering (ETL/ELT, pipelines, data processing)
  • Hands-on experience building and maintaining ML pipelines in production environments
  • Experience with PySpark / Apache Spark
  • Experience with workflow orchestration tools like Airflow, Dagster, or Prefect
  • Good understanding of streaming/data processing systems (Kafka, Kinesis, etc.)
  • Experience working with cloud platforms (AWS / GCP / Azure)
  • Strong SQL skills and experience with data warehouses
  • Comfortable working in a distributed/remote engineering setup

Nice To Haves

  • Experience with NLP or LLM-based systems
  • Familiarity with MLOps tools like MLflow, Kubeflow, or similar
  • Experience with feature stores
  • Exposure to data privacy, PII detection, or compliance-related systems

Responsibilities

  • Design and build large-scale data pipelines for ingestion, transformation and processing
  • Work on ETL/ELT workflows handling different types of data
  • Build and maintain end-to-end ML pipelines from data preparation to deployment and monitoring
  • Collaborate with data scientists to productionize ML models
  • Work on feature engineering, training pipelines and model serving
  • Ensure data quality, monitoring and pipeline reliability
  • Optimize systems for performance, scalability and cost
  • Contribute to clean, maintainable, production-grade Python code
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service