Data Architect - Databricks

SteampunkMcLean, VA
1d$125,000 - $175,000

About The Position

We are looking for a seasoned Data Architect to work with our team and our clients to develop enterprise grade data platforms, services, and pipelines. We are looking for someone that has experience architecting entire data ecosystems levering multiple concepts and constructs such as data marts, data warehouses, data lakes, data lake houses, data mesh, and data fabric. We’re looking for more than just a "Data Architect", but a technologist with excellent communication and customer service skills and a passion for data and problem solving.

Requirements

  • 12 years of experience with a Bachelors Degree OR 9 years of experience with a Masters Degree
  • Ability to hold a position of public trust with the US government.
  • 5+ years direct experience in with architecting complex enterprise level Data Ecosystems using tools such as: Big data tools: Databricks, Apache Spark, Delta Lake, etc. SQL and NoSQL databases Data pipeline and workflow management tools: Databricks Workflows, Airflow, Step Functions, etc. AWS cloud services: Databricks on AWS, S3, EC2, RDS (or Azure equivalents). Object-oriented/object function scripting languages: PySpark/Python, Java, C++, Scala, etc.
  • Experience working with Data Lakehouse architecture and Delta Lake/Apache Iceberg.
  • Advanced working SQL knowledge and experience working with relational databases, query authoring and optimization (SQL) as well as working familiarity with a variety of databases.
  • Experience manipulating, processing, and extracting value from large, disconnected datasets.
  • Ability to inspect existing data pipelines, discern their purpose and functionality, and re-implement them efficiently in Databricks.
  • Experience manipulating structured and unstructured data.
  • Experience architecting data systems (marts, warehouses, lakes, lake houses, etc.)
  • Experience with data cataloging tools such as Collibra, Unity Catalog, and Alation,
  • Experience working in an Agile environment.

Responsibilities

  • Lead and architect a new Enterprise Data Ecosystem that will support the creation and consumption of governed and standardized data products, and making them available in a variety of formats using role based access controls.
  • Build the structure and governance for developing a data product catalog, defining what constitutes a data product and defining “the checklist” that’s required for inclusion.
  • Lead diverse teams comprised of roles such as data analysts, data engineers, data governance SMEs, BI / Visualization SMEs, and data scientists
  • Address technical inquiries concerning customization, integration, enterprise architecture and general feature/functionality of data products
  • Support an Agile software development lifecycle
  • You will contribute to the growth of our AI & Data Exploitation Practice!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service