GalaxEsystems-posted 11 days ago
Full-time • Senior
Remote • Plano, TX
1,001-5,000 employees

We are seeking a skilled Data Engineer to design, build, and maintain scalable data pipelines and analytics solutions. The ideal candidate will work closely with cross-functional teams to ensure efficient data availability, accuracy, and performance across the organization.

  • Develop and manage end-to-end ETL pipelines using PySpark and SQL.
  • Build and optimize data workflows on Azure Databricks for large-scale data processing.
  • Automate workflow orchestration and scheduling using Apache Airflow.
  • Ensure data quality, reliability, and integrity across multiple data sources.
  • Collaborate with Data Scientists, Analysts, and Architects to support business intelligence and analytics initiatives.
  • Strong hands-on experience with Azure Databricks and PySpark.
  • Expertise in SQL for data transformation, performance tuning, and querying.
  • Proven experience in ETL development and data pipeline optimization.
  • Practical knowledge of Apache Airflow for scheduling and orchestration.
  • Understanding of cloud data architectures, preferably Microsoft Azur
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service