Measuring similarity between gene expression profiles: a Bayesian approach.

Nguyen, Viet-Anh; Lió, Pietro

Measuring similarity between gene expression profiles: a Bayesian approach.

Repository URI

http://www.dspace.cam.ac.uk/handle/1810/237888

Files

1471-2164-10-S3-S14.xml (67.75 KB)

1471-2164-10-S3-S14.pdf (558.15 KB)

Type

Conference Object

Authors

Nguyen, Viet-Anh

Lió, Pietro

Abstract

BACKGROUND: Grouping genes into clusters on the basis of similarity between their expression profiles has been the main approach to predict functional modules, from which important inference or further investigation decision could be made. While the univocal determination of similarity metric is important, current practices are normally involved with Euclidean distance and Pearson correlation, of which assumptions are not likely the case for high-throughput microarray data. RESULTS: We advocate the use of a novel metric - BayesGen - to measure similarity between gene expression profiles, and demonstrate its performance on two important applications: constructing genome-wide co-expression network, and clustering cancer human tissues into subtypes. BayesGen is formulated as the evidence ratio between two alternative hypotheses about the generating mechanism of a given pair of genes, and incorporates as prior knowledge the global characteristics of the whole dataset. Through the joint modelling of expected intensity levels and noise variances, it addresses the inherent nonlinearity and the association of noise levels across different microarray value ranges. The full Bayesian formulation also facilitates the possibility of meta-analysis. CONCLUSION: BayesGen allows more effective extraction of similarity information between genes from microarray expression data, which has significant effect on various inference tasks. It also provides a robust choice for other object-feature data, as illustrated through the results of the test on synthetic data.

Keywords

Bayes Theorem, Biometry, Gene Expression Profiling, Gene Expression Regulation, Neoplastic, Humans, Neoplasms

Journal Title

BMC Genomics

Journal ISSN

1471-2164
1471-2164

Publisher

Springer Science and Business Media LLC

Publisher DOI

https://doi.org/10.1186/1471-2164-10-S3-S14

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Collections

Scholarly Works - Computer Science and Technology
Symplectic mapped items for data match