Takeda Development Center Americas, Inc. is seeking a Principal Data Engineer with the following duties: engineer cloud-based data pipelines using Python, Spark, and Airflow to automate ETL/ELT processes, enabling efficient data ingestion, transformation, and storage across data lakes and warehouses; design and implement AI/ML and GenAI-driven solutions using supervised/unsupervised learning, statistical modeling, and NLP to enhance data quality, automate workflows, detect similarities, and support evidence-based clinical decision-making; develop robust data integration workflows for structured and unstructured data, ensuring adherence to Good Clinical Practices (GCP), FDA regulations, and SOPs through SQL-based data validation frameworks; create interactive dashboards and real-time visualization platforms to deliver actionable insights from clinical and operational data, enabling stakeholders to monitor performance and drive data-informed strategies; develop custom automation tools using Python, R, and APIs to streamline data entry, reduce manual processing, and enhance operational efficiency across clinical research systems; drive strategic alignment by partnering with crossfunctional teams, mentoring junior engineers, and advising leadership on AI/ML adoption, automation strategies, and emerging data technologies; influence industry practices by presenting technical innovations at leading conferences and guiding enterprise-wide adoption of scalable, AI-powered data engineering solutions. 100% remote work allowed anywhere in the U.S.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Principal