Modeling mutational effects on biochemical phenotypes using convolutional neural networks: application to SARS-CoV-2.
View / Open Files
Publication Date
2022-07-15Journal Title
iScience
ISSN
2589-0042
Publisher
Elsevier BV
Volume
25
Issue
7
Language
eng
Type
Article
This Version
VoR
Metadata
Show full item recordCitation
Wang, B., & Gamazon, E. R. (2022). Modeling mutational effects on biochemical phenotypes using convolutional neural networks: application to SARS-CoV-2.. iScience, 25 (7) https://doi.org/10.1016/j.isci.2022.104500
Abstract
Deep mutational scanning (DMS) experiments have been performed on SARS-CoV-2's spike receptor-binding domain (RBD) and human angiotensin-converting enzyme 2 (ACE2) zinc-binding peptidase domain-both central players in viral infection and evolution and antibody evasion-quantifying how mutations impact biochemical phenotypes. We modeled biochemical phenotypes from massively parallel assays, using neural networks trained on protein sequence mutations in the virus and human host. Neural networks were significantly predictive of binding affinity, protein expression, and antibody escape, learning complex interactions and higher-order features that are difficult to capture with conventional methods from structural biology. Integrating the physicochemical properties of amino acids, such as hydrophobicity and long-range non-bonded energy per atom, significantly improved prediction (empirical p < 0.01). We observed concordance of the neural network predictions with molecular dynamics (multiple 500 ns or 1 μs all-atom) simulations of the spike protein-ACE2 interface, with critical implications for the use of deep learning to dissect molecular mechanisms.
Keywords
Health Sciences, Computational Intelligence, Computational Molecular Modeling
Identifiers
PMC9159778, 35669036
External DOI: https://doi.org/10.1016/j.isci.2022.104500
This record's URL: https://www.repository.cam.ac.uk/handle/1810/338899
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International
Licence URL: https://creativecommons.org/licenses/by-nc-nd/4.0/
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.