Principal Research Data Engineer

BayerSt. Louis, MO
$142,000 - $185,000Remote

About The Position

Principal Research Data Engineer for St. Louis, MO to oversee the development & implementation of research data pipelines for producing data layers and storing research data; implement & maintain scalable data-intensive processing pipelines that apply geospatial to ML/DL models; architect, build & launch new data models to provide intuitive analytics to business users; develop infrastructure to inform on key metrics, recommend changes & predict future results; develop POCs for new pipelines for integration into science data pipeline through collaboration with diverse research partners.

Requirements

  • Master’s in Information Science, C.S., Data Science, Data Analytics, or closely related field
  • 5 years of experience designing, developing, testing, and implementing scalable geospatial data integration pipelines that encompass statistical yield analysis and interactive report and visualization generation
  • Working with raster & vector geospatial datasets applied to machine learning model generation and deployment in big data environment
  • Packaging & deploying models and data pipelines using CI/CD practices, including production readiness and performance tuning activities using Python and/or Conda, Docker, Airflow, and Git CI/CD
  • Using Google Cloud Platform, Google Cloud Functions, Google Big Query, and Data Proc to process data at scale and deliver robust data pipelines
  • Using Avro, Parquet, CSVs, Geotiff and GeoJSON file formats
  • Programming in SQL
  • Conducting query optimization & Online Analytical Processing on RDBMS and No-SQL databases
  • Using QGIS, ArcGIS & Postgis to ingest and process geospatial data in Avro, CSVs, and GeoJSON

Responsibilities

  • Oversee the development & implementation of research data pipelines for producing data layers and storing research data
  • Implement & maintain scalable data-intensive processing pipelines that apply geospatial to ML/DL models
  • Architect, build & launch new data models to provide intuitive analytics to business users
  • Develop infrastructure to inform on key metrics, recommend changes & predict future results
  • Develop POCs for new pipelines for integration into science data pipeline through collaboration with diverse research partners

Benefits

  • health care
  • vision
  • dental
  • retirement
  • PTO
  • sick leave
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service