Repository logo

Prediction of gene expression in embryonic structures of Drosophila melanogaster.

Published version



Change log


Samsonova, Anastasia A 
Niranjan, Mahesan 
Russell, Steven 
Brazma, Alvis 


Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms.



Algorithms, Animals, Artificial Intelligence, Cluster Analysis, Computational Biology, Databases, Genetic, Drosophila melanogaster, Embryo, Nonmammalian, Gene Expression, Gene Expression Profiling, Gene Regulatory Networks, Genes, Developmental, Genes, Insect, In Situ Hybridization, Nucleic Acid Hybridization, Oligonucleotide Array Sequence Analysis

Journal Title

PLoS Comput Biol

Conference Name

Journal ISSN


Volume Title



Public Library of Science (PLoS)
Medical Research Council (G8225539)
BBSRC (G18877)