Repository logo
 

mmCSM-NA: accurately predicting effects of single and multiple mutations on protein‒nucleic acid binding affinity

Accepted version
Peer-reviewed

No Thumbnail Available

Type

Article

Change log

Authors

Abstract

While protein–nucleic acid interactions are pivotal for many crucial biological processes, limited experimental data has made the development of computational approaches to characterise these interactions a challenge. Consequently, most approaches to understand the effects of missense mutations on protein-nucleic acid affinity have focused on single-point mutations and have presented a limited performance on independent data sets. To overcome this, we have curated the largest dataset of experimentally measured effects of mutations on nucleic acid binding affinity to date, encompassing 856 single-point mutations and 141 multiple-point mutations across 155 experimentally solved complexes. This was used in combination with an optimized version of our graph-based signatures to develop mmCSM-NA (http://biosig.unimelb.edu.au/mmcsm_na), the first scalable method capable of quantitatively and accurately predicting the effects of multiple-point mutations on nucleic acid binding affinities. mmCSM-NA obtained a Pearson’s correlation of up to 0.67 (RMSE of 1.06 Kcal/mol) on single-point mutations under cross-validation, and up to 0.65 on independent non-redundant datasets of multiple-point mutations (RMSE of 1.12 Kcal/mol), outperforming similar tools. mmCSM-NA is freely available as an easy-to-use web server and API. We believe it will be an invaluable tool to shed light on the role of mutations affecting protein–nucleic acid interactions in diseases.

Description

Keywords

Journal Title

NAR Genomics and Bioinformatics

Conference Name

Journal ISSN

2631-9268

Volume Title

Publisher

Oxford University Press

Publisher DOI

Publisher URL

Sponsorship
Alex G. C. de Sá is currently supported by the Joe White Bequest Fellowship. D.B.A. and D.E.V.P. were funded by a Newton Fund RCUK-CONFAP Grant awarded by The Medical Research Council (MR/M026302/1). D.B.A. was supported by the Jack Brockhoff Foundation (JBF 4186, 2016), Wellcome Trust (200814/Z/16/Z), and an Investigator Grant from the National Health and Medical Research Council (NHMRC) of Australia (GNT1174405). Supported in part by the Victorian Government’s Operational Infrastructure Support Program.