Junior Data Engineer

Sembcorp IndustriesCentral, LA
29d

About The Position

Sembcorp is a leading energy and urban solutions provider headquartered in Singapore. Led by its purpose to drive energy transition, Sembcorp delivers sustainable energy solutions and urban developments by leveraging its sector expertise and global track record. Join us in shaping a sustainable energy future Drive Asia's energy transition with us! Our Gas & Related Services segment is a key growth engine, delivering reliable and efficient energy to industries and communities across multiple countries. We support Asia's growing energy needs while advancing the shift to a lower-carbon future.

Requirements

  • Degree in Computer Science, Data Science or related field
  • 2 years in Azure Data Engineering, Python, PySpark, or Big Data development and 1 year experience using Azure IOT hubs, Azure stream analytics working with Realtime streaming data
  • Familiarity with Azure Synapse Analytics, Azure batch,
  • Strong knowledge of SQL, Data Warehousing, Data Marts, and data ingestion using PySpark and Python.
  • Hands-on experience in developing and maintaining ETL pipelines on cloud platforms (Azure preferred)
  • Familiarity with Microsoft Fabric, Databricks, Synapse and datalake
  • Understanding of DevOps practices for deployment and automation (Azure DevOps preferred).
  • Strong problem-solving, communication, and interpersonal skills.
  • Ability to work collaboratively in a team environment.

Responsibilities

  • Design, develop, and maintain data pipelines for ingestion, transformation, and storage across multiple environments with tools like Pyspark, Azure fabric or databaricks notebooks
  • Design and development of real-time streaming data pipelines using IOT and Stream analytics or Apache Kafka
  • Deploy pipeline artifacts from one environment to another using Azure DevOps and ensure smooth CI/CD processes.
  • Collaborate with data architects and analysts to implement scalable data solutions.
  • Monitor and optimize ETL workflows for performance and reliability.
  • Ensure data quality, security, and compliance across all stages of the data lifecycle.
  • Support troubleshooting and resolution of data pipeline issues in production environments.
  • Work closely with cross-functional teams to deliver data for analytics and reporting needs.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service