Repository logo
 

MolScore: a scoring, evaluation and benchmarking framework for generative models in de novo drug design

Published version
Peer-reviewed

Repository DOI


Change log

Authors

Thomas, Morgan 
O'Boyle, Noel M 
Bender, Andreas 
De Graaf, Chris 

Abstract

Generative models are undergoing rapid research and application to de novo drug design. To facilitate their application and evaluation, we present MolScore. MolScore already contains many drug-design-relevant scoring functions commonly used in benchmarks such as, molecular similarity, molecular docking, predictive models, synthesizability, and more. In addition, providing performance metrics to evaluate generative model performance based on the chemistry generated. With this unification of functionality, MolScore re-implements commonly used benchmarks in the field (such as GuacaMol, MOSES, and MolOpt). Moreover, new benchmarks can be created trivially. We demonstrate this by testing a chemical language model with reinforcement learning on three new tasks of increasing complexity related to the design of 5-HT2a ligands that utilise either molecular descriptors, 266 pre-trained QSAR models, or dual molecular docking. Lastly, MolScore can be integrated into an existing Python script with just three lines of code. This framework is a step towards unifying generative model application and evaluation as applied to drug design for both practitioners and researchers. The framework can be found on GitHub and downloaded directly from the Python Package Index.

Description

Keywords

Benchmarking, De novo molecule generation, Drug design, Generative model, Scoring functions

Journal Title

Journal of Cheminformatics

Conference Name

Journal ISSN

1758-2946
1758-2946

Volume Title

16

Publisher

BMC