Data Engineer

The Friedkin GroupHouston, TX
10d

About The Position

This Data Engineer supports the development and operation of data ingestion pipelines and analytics solutions using Databricks for multi-disciplinary departments across the organization. This role is designed for data engineers interested in modern data engineering, data modeling, and Databricks application development while learning enterprise standards for reliability, security, and governance.

Requirements

  • Bachelor's Degree in Data Engineering, Computer Science, MIS, or other related discipline.
  • 3-4 years of experience including interships or academic projects Required
  • Familiarity with Databricks, Apache Spark, or PySpark Required
  • Understanding of ETL/ELT, schemas, and data quality concepts Required
  • Experience using Delta Lake
  • Experience developing metadata-driven ETL processes
  • Experience creating analytical data models
  • Experience developing Databricks applications or dashboards
  • Proficiency in data modeling and database management.
  • Familiarity with data warehousing and cloud platforms (e.g., AWS, Azure, or Google Cloud).
  • Strong analytic skills related to working with unstructured datasets.
  • Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores.
  • Strong project management and organizational skills.
  • Ability to collaborate effectively with cross-functional teams and learn in a fast-paced environment.
  • Excellent communication skills written and verbal.
  • Exceptional organizational and problem-solving skills
  • Strong business acumen
  • Skill in collecting and analyzing complex data.
  • Knowledge of various operating systems and platforms.
  • Problem solving and analytical skills.

Nice To Haves

  • Working knowledge of Python and Spark SQL
  • Exposure to cloud platforms such as AWS, Azure, or GCP

Responsibilities

  • Build and maintain Databricks-based ingestion pipelines using Delta Lake
  • Support batch and incremental ingestion workflows
  • Ingest data from databases, flat files, and APIs
  • Develop metadata-driven ETL processes
  • Create and maintain data models for analytics and reporting
  • Develop Databricks applications, notebooks, and dashboards
  • Write documented code aligned with engineering standards
  • Participate in code reviews, testing, and deployment
  • Monitor pipelines and troubleshoot failures
  • Collaborate with multi-disciplinary business units to support data use cases
  • Document data pipelines, models, and operational procedures

Benefits

  • Career Growth: Advance your career with opportunities for leadership and personal development.
  • Culture of Excellence: Be part of a supportive team that values your input and encourages innovation.
  • Competitive Benefits: Enjoy a comprehensive benefits package that looks after both your professional and personal needs.
  • medical, dental, and vision insurance
  • wellness programs
  • retirement plans
  • generous paid leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service