Principal Data Engineer

TakedaBoston, MA
10h$139,000 - $236,800Remote

About The Position

Takeda Development Center Americas, Inc. is seeking a Principal Data Engineer with the following duties: engineer cloud-based data pipelines using Python, Spark, and Airflow to automate ETL/ELT processes, enabling efficient data ingestion, transformation, and storage across data lakes and warehouses; design and implement AI/ML and GenAI-driven solutions using supervised/unsupervised learning, statistical modeling, and NLP to enhance data quality, automate workflows, detect similarities, and support evidence-based clinical decision-making; develop robust data integration workflows for structured and unstructured data, ensuring adherence to Good Clinical Practices (GCP), FDA regulations, and SOPs through SQL-based data validation frameworks; create interactive dashboards and real-time visualization platforms to deliver actionable insights from clinical and operational data, enabling stakeholders to monitor performance and drive data-informed strategies; develop custom automation tools using Python, R, and APIs to streamline data entry, reduce manual processing, and enhance operational efficiency across clinical research systems; drive strategic alignment by partnering with crossfunctional teams, mentoring junior engineers, and advising leadership on AI/ML adoption, automation strategies, and emerging data technologies; influence industry practices by presenting technical innovations at leading conferences and guiding enterprise-wide adoption of scalable, AI-powered data engineering solutions. 100% remote work allowed anywhere in the U.S.

Requirements

  • Master’s degree in Computer Science, Data Science, Engineering, or related field, plus 30 months of related experience.
  • Prior experience must include: design, develop, test, and deploy software applications and features based on client and project requirements
  • Implement automated testing and regression testing using Selenium and Python to improve test coverage, reduce manual effort, and ensure application stability
  • Collaborate with cross-functional teams, including developers, business analysts, and QA leads, to identify test requirements and participate in Agile/Scrum ceremonies to plan, deliver, and communicate software progress iteratively
  • Perform data wrangling, transformation, and management to create structured datasets stored in databases, supporting data analyses.

Responsibilities

  • Engineer cloud-based data pipelines using Python, Spark, and Airflow to automate ETL/ELT processes, enabling efficient data ingestion, transformation, and storage across data lakes and warehouses
  • Design and implement AI/ML and GenAI-driven solutions using supervised/unsupervised learning, statistical modeling, and NLP to enhance data quality, automate workflows, detect similarities, and support evidence-based clinical decision-making
  • Develop robust data integration workflows for structured and unstructured data, ensuring adherence to Good Clinical Practices (GCP), FDA regulations, and SOPs through SQL-based data validation frameworks
  • Create interactive dashboards and real-time visualization platforms to deliver actionable insights from clinical and operational data, enabling stakeholders to monitor performance and drive data-informed strategies
  • Develop custom automation tools using Python, R, and APIs to streamline data entry, reduce manual processing, and enhance operational efficiency across clinical research systems
  • Drive strategic alignment by partnering with crossfunctional teams, mentoring junior engineers, and advising leadership on AI/ML adoption, automation strategies, and emerging data technologies
  • Influence industry practices by presenting technical innovations at leading conferences and guiding enterprise-wide adoption of scalable, AI-powered data engineering solutions

Benefits

  • U.S. based employees may be eligible for short-term and/or long-term incentives.
  • U.S. based employees may be eligible to participate in medical, dental, vision insurance, a 401(k) plan and company match, short-term and long-term disability coverage, basic life insurance, a tuition reimbursement program, paid volunteer time off, company holidays, and well-being benefits, among others.
  • U.S. based employees are also eligible to receive, per calendar year, up to 80 hours of sick time, and new hires are eligible to accrue up to 120 hours of paid vacation.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service