This position focuses on Data pipelines & workflows. The role involves embedding within a cross-functional Agile team to design, build, troubleshoot, and maintain ETL/ELT workflows that support application functionality, analytics, reporting, and scientific workflows. The engineer will develop and manage data pipelines using Apache Airflow, ensuring reliable orchestration, scheduling, monitoring, and recovery. Collaboration with stakeholders including software developers, scientists, and engineers is key to understanding data sources, workflow requirements, and downstream data needs. Responsibilities include extracting, transforming, validating, and loading data across systems, writing and optimizing SQL queries, troubleshooting data quality issues and pipeline bottlenecks, and supporting database exploration. The role also involves evaluating and adopting new data tools and technologies, supporting integration between data pipelines and applications, assisting with schema evolution and data modeling, and documenting pipeline logic. Improving data engineering standards, observability, testing practices, and operational reliability are also part of the role. Regular interaction with scientists and engineers to understand research and technical workflows is expected, with experience in scientific or research environments being a strong plus.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Entry Level