Repository logo
 

An empirical Bayes approach to network recovery using external knowledge.

Accepted version
Peer-reviewed

Type

Article

Change log

Authors

Kpogbezan, Gino B 
van der Vaart, Aad W 
van Wieringen, Wessel N 
Leday, Gwenaël GR 
van de Wiel, Mark A 

Abstract

Reconstruction of a high-dimensional network may benefit substantially from the inclusion of prior knowledge on the network topology. In the case of gene interaction networks such knowledge may come for instance from pathway repositories like KEGG, or be inferred from data of a pilot study. The Bayesian framework provides a natural means of including such prior knowledge. Based on a Bayesian Simultaneous Equation Model, we develop an appealing Empirical Bayes (EB) procedure that automatically assesses the agreement of the used prior knowledge with the data at hand. We use variational Bayes method for posterior densities approximation and compare its accuracy with that of Gibbs sampling strategy. Our method is computationally fast, and can outperform known competitors. In a simulation study, we show that accurate prior data can greatly improve the reconstruction of the network, but need not harm the reconstruction if wrong. We demonstrate the benefits of the method in an analysis of gene expression data from GEO. In particular, the edges of the recovered network have superior reproducibility (compared to that of competitors) over resampled versions of the data.

Description

Keywords

Empirical Bayes, High-dimensional Bayesian inference, Prior information, Undirected network, Variational approximation, Bayes Theorem, Biometry, Computer Simulation, Gene Regulatory Networks, Models, Statistical, Pilot Projects, Reproducibility of Results

Journal Title

Biom J

Conference Name

Journal ISSN

0323-3847
1521-4036

Volume Title

59

Publisher

Wiley
Sponsorship
The research leading to these results has received funding from the European Research Council under ERC Grant Agreement 320637.