Junior Data Engineer

Ratio Therapeutics, Inc.Boston, MA
2d$65,000 - $75,000

About The Position

Ratio Therapeutics is seeking a highly motivated and detail-oriented Junior Data Engineer to join our dynamic Data team. This role will be pivotal in building, maintaining, and monitoring data pipelines that support our laboratory, R&D, and operational functions. The Junior Data Engineer will collaborate closely with scientists, engineers, and IT professionals to design and implement reliable data workflows, integrate data from diverse laboratory systems, and help establish best practices for data quality and governance.

Requirements

  • Bachelor’s degree in Computer Science, Data Science, Engineering, Information Systems, Bioinformatics, or a related technical field, or equivalent practical experience.
  • Foundational experience with programming for data work, preferably in Python or a similar language (through coursework, internships, or projects).
  • Experience developing web-based user interfaces (e.g. VueJS, React).
  • Understanding of ETL/ELT concepts and data pipelines, including working with structured data (e.g., CSV, relational databases) and semi-structured data (e.g., JSON).
  • Basic experience with SQL for querying and transforming data.
  • Familiarity with version control (e.g., Git) and collaborative software development practices is a plus.
  • Strong analytical and problem-solving skills, with an interest in debugging and improving data workflows.
  • Excellent communication and collaboration skills, with the ability to work effectively with both technical and non-technical stakeholders.

Nice To Haves

  • Exposure to data integration in a lab, biotech, or healthcare environment (e.g., ELN, LIMS, instrument data) is a plus but not required.
  • Exposure to cloud platforms (e.g., AWS, Azure, GCP) or data engineering tools (e.g., Airflow, dbt, Spark) is a plus.

Responsibilities

  • Design, implement, and maintain data pipelines that move and transform data from laboratory instruments, ELNs, LIMS, and other source systems into our centralized data platforms.
  • Work closely with stakeholders to understand data requirements, data models, and workflows, and translate these into robust, well-documented data engineering solutions.
  • Develop and maintain reusable scripts and automation (e.g., for file ingestion, API integrations, data transformations, and validation checks) using modern programming languages and tools (e.g., Python).
  • Monitor data pipelines and jobs, investigate failures or performance issues, and help improve pipeline reliability, observability, and scalability over time.
  • Implement data quality checks and validation rules to ensure completeness, consistency, and accuracy of laboratory and study data.
  • Collaborate with the data team to deploy, version, and maintain data workflows in shared environments (e.g., dev/test/prod), following best practices for code review and change management.
  • Contribute to documentation and standards for data schemas, pipeline designs, and data governance, helping to establish and refine best practices for the Data team.
  • Support end users (scientists, analysts, and other stakeholders) by troubleshooting data-related issues and building small utilities or tools that make data easier to access and use.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service