Data Engineer - 1010

LightcastMoscow, ID
10d

About The Position

The Data Engineer is responsible for building and maintaining systems that improve data reliability, quality, and accessibility. This role focuses on developing software and data pipelines that extract, transform, and load (ETL) large volumes of data from multiple sources into consistent, machine-readable formats. Data Engineers design and test architectures that support data extraction and transformation for predictive and prescriptive analytics. Working within an agile Scrum environment, the Data Engineer participates in two-week sprint cycles, contributing to sprint planning, development, and review processes to continuously improve data infrastructure and product capabilities.

Requirements

  • Bachelor’s in Computer Science, Mathematics, Statistics, or related field (or equivalent experience)
  • 2+ years in data engineering, data analysis, or similar roles
  • Experience with ETL, data warehousing, and scalable data systems.
  • Solid understanding of statistical methods and data analysis
  • Strong communication skills with ability to work independently and in teams
  • Proficiency in Python and SQL; familiarity with Java, C/C++, or similar
  • Strong software engineering, programming, and QA skills
  • Experience with Linux/Unix (POSIX systems) and Git

Nice To Haves

  • Experience with Docker, Terraform, and AWS (Batch, EMR/Spark, ecosystem tools)
  • Familiarity with GitLab and Scala

Responsibilities

  • Data Transformation: Collect, manage, and transform large-scale raw data from multiple sources into structured, reliable, and usable formats that meet business and analytical requirements.
  • Data Tool Creation: Design, develop, and maintain data architectures, pipelines, and applications using appropriate tools and frameworks to support scalable data processing and product development.
  • ETL Development: Build and maintain robust extract, transform, and load (ETL) processes that ensure data accuracy, efficiency, and availability across systems.
  • Root Cause Analysis & Continuous Improvement: Perform root cause analysis to identify data issues, system inefficiencies, or process gaps, and implement improvements that enhance data quality and operational performance.
  • Agile Collaboration: Work in an agile Scrum environment by contributing to sprint planning, development tasks, testing, and sprint reviews to deliver reliable and scalable data solutions
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service