Repository logo
 

Generalized empirical Bayesian methods for discovery of differential data in high-throughput biology.


Type

Article

Change log

Authors

Hardcastle, Thomas J 

Abstract

MOTIVATION: High-throughput data are now commonplace in biological research. Rapidly changing technologies and application mean that novel methods for detecting differential behaviour that account for a 'large P, small n' setting are required at an increasing rate. The development of such methods is, in general, being done on an ad hoc basis, requiring further development cycles and a lack of standardization between analyses. RESULTS: We present here a generalized method for identifying differential behaviour within high-throughput biological data through empirical Bayesian methods. This approach is based on our baySeq algorithm for identification of differential expression in RNA-seq data based on a negative binomial distribution, and in paired data based on a beta-binomial distribution. Here we show how the same empirical Bayesian approach can be applied to any parametric distribution, removing the need for lengthy development of novel methods for differently distributed data. Comparisons with existing methods developed to address specific problems in high-throughput biological data show that these generic methods can achieve equivalent or better performance. A number of enhancements to the basic algorithm are also presented to increase flexibility and reduce computational costs. AVAILABILITY AND IMPLEMENTATION: The methods are implemented in the R baySeq (v2) package, available on Bioconductor http://www.bioconductor.org/packages/release/bioc/html/baySeq.html. CONTACT: tjh48@cam.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Description

Keywords

Algorithms, Animals, Bayes Theorem, Binomial Distribution, Gene Expression Profiling, Rats, Sequence Analysis, RNA, Software

Journal Title

Bioinformatics

Conference Name

Journal ISSN

1367-4803
1367-4811

Volume Title

32

Publisher

Oxford University Press (OUP)
Sponsorship
European Research Council (340642)
This work was supported by European Research Council Advanced Investigator Grant ERC-2013-AdG 340642 – TRIBE.