A guide to best practices for Gene Ontology (GO) manual annotation.


Type
Article
Change log
Authors
Balakrishnan, Rama 
Harris, Midori A 
Huntley, Rachael 
Van Auken, Kimberly 
Cherry, J Michael 
Abstract

The Gene Ontology Consortium (GOC) is a community-based bioinformatics project that classifies gene product function through the use of structured controlled vocabularies. A fundamental application of the Gene Ontology (GO) is in the creation of gene product annotations, evidence-based associations between GO definitions and experimental or sequence-based analysis. Currently, the GOC disseminates 126 million annotations covering >374,000 species including all the kingdoms of life. This number includes two classes of GO annotations: those created manually by experienced biocurators reviewing the literature or by examination of biological data (1.1 million annotations covering 2226 species) and those generated computationally via automated methods. As manual annotations are often used to propagate functional predictions between related proteins within and between genomes, it is critical to provide accurate consistent manual annotations. Toward this goal, we present here the conventions defined by the GOC for the creation of manual annotation. This guide represents the best practices for manual annotation as established by the GOC project over the past 12 years. We hope this guide will encourage research communities to annotate gene products of their interest to enhance the corpus of GO annotations available to all. DATABASE URL: http://www.geneontology.org.

Description
Keywords
Biological Phenomena, Data Mining, Decision Trees, Molecular Sequence Annotation, Reference Standards, Sequence Homology, Nucleic Acid
Journal Title
Database (Oxford)
Conference Name
Journal ISSN
1758-0463
1758-0463
Volume Title
2013
Publisher
Oxford University Press (OUP)