Data Engineer

Base Operations
67d

About The Position

Base Operations is building the world’s largest dataset of global threat patterns and street-level intelligence. We are looking for an exceptional data engineer that can define, build, and mature the data pipelines, models, and warehouse to help us excel in this objective. This individual should have the technical acumen to contribute thought leadership relative to our architecture strategy, while also excelling at mapping that strategy into a tactical plan and executing against it.

Requirements

  • 3+ years experience building production-level data pipelines and standing up the platforms to support them.
  • 3+ experience implementing and maintaining data warehouses, data models, and performing schema migrations. Experience standing up a data warehouse from scratch as well as AWS tools such as Athena and S3 a big plus.
  • Strong working knowledge of SQL, data models, and performing schema migrations.
  • Demonstrable thought leadership on operationalizing data quality.
  • Development experience delivering production ready code in Python. Familiarity with Pandas framework.
  • Strong collaborator across functional areas, incl. data science, infrastructure, software development, and product.

Nice To Haves

  • Hands on knowledge of GIS enabled data stores such as Postgres/GIS, GIS-related libraries such as GeoPandas, and analytic data platforms such as DataBricks.
  • Experience with infrastructure and automation tools such as Jenkins, Argo Workflows, Argo CD, Terraform, Kubernetes or equivalent.
  • Deep understanding of ML and AI based infrastructure.
  • Familiarity with JavaScript.

Responsibilities

  • Build, test, and maintain robust ingestion and transformation pipelines, incl. the injection of NLP models to extract and augment data from free text sources.
  • Develop data transformation, validation and analysis methods to augment the utility and actionability of data.
  • Build out a data warehousing strategy to realize and enhance the full data lifecycle.
  • Operationalize data quality throughout, ensuring visibility and timely remediation of data quality issues.
  • Contribute to GIS data architecture strategies, and drive transformation and implementation activities resulting from those strategies.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service