MolE8: finding DFT potential energy surface minima values from force-field optimised organic molecules with new machine learning representations.
Royal Society of Chemistry (RSC)
MetadataShow full item record
Lee, S., Ermanis, K., & Goodman, J. (2022). MolE8: finding DFT potential energy surface minima values from force-field optimised organic molecules with new machine learning representations.. Chem Sci https://doi.org/10.1039/d1sc06324c
The use of machine learning techniques in computational chemistry has gained significant momentum since large molecular databases are now readily available. Predictions of molecular properties using machine learning have advantages over the traditional quantum mechanics calculations because they can be cheaper computationally without losing the accuracy. We present a new extrapolatable and explainable molecular representation based on bonds, angles and dihedrals that can be used to train machine learning models. The trained models can accurately predict the electronic energy and the free energy of small organic molecules with atom types C, H N and O, with a mean absolute error of 1.2 kcal mol-1. The models can be extrapolated to larger organic molecules with an average error of less than 3.7 kcal mol-1 for 10 or fewer heavy atoms, which represent a chemical space two orders of magnitude larger. The rapid energy predictions of multiple molecules, up to 7 times faster than previous ML models of similar accuracy, has been achieved by sampling geometries around the potential energy surface minima. Therefore, the input geometries do not have to be located precisely on the minima and we show that accurate density functional theory energy predictions can be made from force-field optimised geometries with a mean absolute error 2.5 kcal mol-1.
Is supplemented by: https://doi.org/10.17863/CAM.78009
Isaac Newton Trust (17.08(D))
Leverhulme Trust (ECF-2017-255)
Engineering and Physical Sciences Research Council (EP/P020259/1)
External DOI: https://doi.org/10.1039/d1sc06324c
This record's URL: https://www.repository.cam.ac.uk/handle/1810/337621
Attribution 4.0 International
Licence URL: https://creativecommons.org/licenses/by/4.0/