Bayesian networks elucidate complex genomic landscapes in cancer.
View / Open Files
Authors
Chatzipli, Aikaterini
Publication Date
2022-04-04Journal Title
Commun Biol
ISSN
2399-3642
Publisher
Springer Science and Business Media LLC
Volume
5
Issue
1
Number
ARTN 306
Pages
306
Type
Article
This Version
VoR
Physical Medium
Electronic
Metadata
Show full item recordCitation
Angelopoulos, N., Chatzipli, A., Nangalia, J., Maura, F., & Campbell, P. J. (2022). Bayesian networks elucidate complex genomic landscapes in cancer.. Commun Biol, 5 (1. ARTN 306), 306. https://doi.org/10.1038/s42003-022-03243-w
Abstract
Bayesian networks (BNs) are disciplined, explainable Artificial Intelligence models that can describe structured joint probability spaces. In the context of understanding complex relations between a number of variables in biological settings, they can be constructed from observed data and can provide a guiding, graphical tool in exploring such relations. Here we propose BNs for elucidating the relations between driver events in large cancer genomic datasets. We present a methodology that is specifically tailored to biologists and clinicians as they are the main producers of such datasets. We achieve this by using an optimal BN learning algorithm based on well established likelihood functions and by utilising just two tuning parameters, both of which are easy to set and have intuitive readings. To enhance value to clinicians, we introduce (a) the use of heatmaps for families in each network, and (b) visualising pairwise co-occurrence statistics on the network. For binary data, an optional step of fitting logic gates can be employed. We show how our methodology enhances pairwise testing and how biologists and clinicians can use BNs for discussing the main relations among driver events in large genomic cohorts. We demonstrate the utility of our methodology by applying it to 5 cancer datasets revealing complex genomic landscapes. Our networks identify central patterns in all datasets including a central 4-way mutual exclusivity between HDR, t(4,14), t(11,14) and t(14,16) in myeloma, and a 3-way mutual exclusivity of three major players: CALR, JAK2 and MPL, in myeloproliferative neoplasms. These analyses demonstrate that our methodology can play a central role in the study of large genomic cancer datasets.
Keywords
Algorithms, Artificial Intelligence, Bayes Theorem, Genomics, Humans, Neoplasms
Identifiers
External DOI: https://doi.org/10.1038/s42003-022-03243-w
This record's URL: https://www.repository.cam.ac.uk/handle/1810/337579
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk