Towards quantifying the uncertainty in in silico predictions using Bayesian learning
View / Open Files
Authors
Allen, TEH
Middleton, AM
Goodman, JM
Russell, PJ
Kukic, P
Gutsell, S
Publication Date
2022-08Journal Title
Computational Toxicology
ISSN
2468-1113
Publisher
Elsevier BV
Number
100228
Pages
100228-100228
Type
Article
This Version
AM
Metadata
Show full item recordCitation
Allen, T., Middleton, A., Goodman, J., Russell, P., Kukic, P., & Gutsell, S. (2022). Towards quantifying the uncertainty in in silico predictions using Bayesian learning. Computational Toxicology, (100228), 100228-100228. https://doi.org/10.1016/j.comtox.2022.100228
Abstract
Next-generation risk assessment (NGRA) involves the combination of in vitro and in silico models for more human-relevant, ethical, and sustainable human chemical safety assessment. NGRA requires a quantitative mechanistic understanding of the effects of chemicals across human biology (be they molecular, cellular, organ-level or higher) coupled with a quantitative understanding of the uncertainty in any experimentally measured or predicted values. These values with their uncertainties can then be considered as a probability distribution, which can then be compared to exposure estimates to establish the presence or absence of a margin of safety. We have constructed Bayesian learning neural networks to provide such quantitative predictions and uncertainties for 20 pharmacologically important human molecular initiating events. These models produce high quality quantitative estimates (p(IC50), p(EC50), p(Ki), p(Kd)) of biochemical activity at a molecular initiating event (MIE) with average mean absolute errors (in Log units) of 0.625 ± 0.048 in test data and 0.941 ± 0.215 in external validation data. The key advantage of these models is their ability to also produce standard deviations and credible intervals (CIs) to quantify the uncertainty in these predictions, which we show to be able to distinguish between molecules close to the training data in chemical structure, those less similar to the training data, and decoy compounds drawn from the wider ChEMBL database. These uncertainty values mean that when a prediction is made a user can understand the certainty of the prediction, similar to a quantitative applicability domain, aiding prediction usefulness in NGRA. The ability for in silico methods to produce quantitative predictions with these kinds of probability distributions will be vital to their further use in NGRA, and here clear first steps have been taken.
Embargo Lift Date
2023-04-30
Identifiers
External DOI: https://doi.org/10.1016/j.comtox.2022.100228
This record's URL: https://www.repository.cam.ac.uk/handle/1810/336947
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International
Licence URL: https://creativecommons.org/licenses/by-nc-nd/4.0/
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk