At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We’re looking for people who are determined to make life better for people around the world. Purpose Lilly TuneLab is an AI-powered drug discovery platform that provides biotech companies with access to machine learning models trained on Lilly's extensive proprietary pharmaceutical research data. Through federated learning, the platform enables Lilly to build models on broad, diverse datasets from across the biotech ecosystem while preserving partner data privacy and competitive advantages. This collaborative approach accelerates drug discovery by creating continuously improving AI models that benefit both Lilly and our biotech partners. The Machine Learning Scientist/Sr Scientist, Federated Benchmarking & Validation Engineering plays an essential role within the TuneLab platform, responsible for identifying, assessing, and implementing cutting-edge algorithmic solutions that leverage diverse datasets while ensuring data privacy and security for our biotech partners. This position requires comprehensive knowledge in small molecule drug development, ADME/Tox, antibody engineering, and/or genetic medicine, combined with expertise in data science and statistical analysis to develop sophisticated models utilizing federated learning. This position will be instrumental in advancing both Lilly's pipeline and our partners' drug discovery efforts by designing critical algorithms and workflows that expedite the creation of transformative therapies. This role centers on constructing robust validation frameworks for federated models, creating privacy-preserving test sets across partner datasets, establishing standardized benchmarks against public datasets, and ensuring model reproducibility and generalization in diverse deployment scenarios.