Repository logo
 

Probabilistic machine learning algorithms for molecule discovery


Loading...
Thumbnail Image

Type

Change log

Authors

Abstract

Discovering new molecules empowers humanity to solve problems in health, agriculture, energy, and more. The key challenge of molecule discovery is that the space of all possible molecules is vastly larger than the amount of molecules which we can test experimentally with our limited resources. Given this challenge, arguably the best option is to judiciously select which molecules to test based on both our current knowledge and our expectation of what information will be gained from each test.

In machine learning, this approach is typically called Bayesian optimisation and has been studied for many other problems, such as tuning hyperparameters of machine learning models. Although in principle Bayesian optimisation can be straightforwardly applied to the problem of discovering new molecules, the discrete nature of molecules means that new models and algorithms are needed to make Bayesian optimisation work in practice.

This thesis presents various probabilistic machine learning algorithms which could be used within a Bayesian optimisation loop to discover new molecules. Latent space optimisation with weighted retraining (chapter 3) and adaptive deep kernel fitting with implicit function theorem (chapter 4) are both algorithms which use a Gaussian process with a deep neural network kernel function to model the relationship between molecular structure and some property of interest. Tanimoto random features (chapter 5) allows an established cheminformatics model to be applied (approximately) to large datasets. Finally, retro-fallback (chapter 6) uses a novel probabilistic formulation of the retrosynthesis problem to estimate whether a molecule can be synthesized, and thereby determine whether it should be considered by Bayesian optimisation. Together, these algorithms form a suite of tools which could be used to discover new molecules automatically and intelligently.

Description

Date

2024-04-20

Advisors

Hernández-Lobato, José Miguel

Qualification

Doctor of Philosophy (PhD)

Awarding Institution

University of Cambridge

Rights and licensing

Except where otherwised noted, this item's license is described as Attribution 4.0 International (CC BY 4.0)