At Apheris, we are building the future of how AI is applied in pharmaceutical R&D. We enable leading pharmaceutical teams to discover and develop drugs faster. We host the industry’s largest federated data networks for drug discovery AI, spanning co-folding, ADMET, and antibody developability. Across these networks, models are trained on proprietary industry datasets to achieve higher performance and broader applicability while keeping data control and IP protected. We deliver these superior models through drug discovery applications that enable teams to run them at scale, further customize them, and integrate them into existing R&D workflows. AI Structural Biology (AISB) Network: Pharmaceutical companies collaborate in the field of co-folding, structure-based binding affinity predictions and antibody design. ADMET Network: Pharmaceutical and biotech companies collaborate to improve small-molecule property prediction and expand in to further drug modalities. Antibody Developability Network: Pharma partners collaborate to federate historical and purpose-built antibody developability datasets for secure ML training, without data leaving each partner’s environment. About the role We are looking for a Forward-Deployed Cheminformatician to own how binding data is prepared across our co-folding focused networks and initiatives. Binding data is the input that decides whether our co-folding and binding-affinity models perform in real drug programs. It arrives from pharma partners in heterogeneous shapes — different assay registries, different metadata, different chemical-representation standards, different choices on qualifiers, replicates and censoring. We need someone who turns this into a repeatable, well-documented preparation pipeline that pharma representatives can run alongside us, and that scales to the public-data corpus we build for our own model training. This is half engineering, half forward-deployed work. You will define the protocol, harden it with validators and scripts, integrate it into the Apheris products, run it with each new partner, and own the equivalent pipeline for the public binding-data corpus.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level