Repository logo
 

A guide to best practices for Gene Ontology (GO) manual annotation.

Published version
Peer-reviewed

Type

Article

Change log

Authors

Balakrishnan, Rama 
Harris, Midori A 
Huntley, Rachael 
Van Auken, Kimberly 
Cherry, J Michael 

Abstract

The Gene Ontology Consortium (GOC) is a community-based bioinformatics project that classifies gene product function through the use of structured controlled vocabularies. A fundamental application of the Gene Ontology (GO) is in the creation of gene product annotations, evidence-based associations between GO definitions and experimental or sequence-based analysis. Currently, the GOC disseminates 126 million annotations covering >374,000 species including all the kingdoms of life. This number includes two classes of GO annotations: those created manually by experienced biocurators reviewing the literature or by examination of biological data (1.1 million annotations covering 2226 species) and those generated computationally via automated methods. As manual annotations are often used to propagate functional predictions between related proteins within and between genomes, it is critical to provide accurate consistent manual annotations. Toward this goal, we present here the conventions defined by the GOC for the creation of manual annotation. This guide represents the best practices for manual annotation as established by the GOC project over the past 12 years. We hope this guide will encourage research communities to annotate gene products of their interest to enhance the corpus of GO annotations available to all. DATABASE URL: http://www.geneontology.org.

Description

Keywords

Biological Phenomena, Data Mining, Decision Trees, Molecular Sequence Annotation, Reference Standards, Sequence Homology, Nucleic Acid

Journal Title

Database (Oxford)

Conference Name

Journal ISSN

1758-0463
1758-0463

Volume Title

2013

Publisher

Oxford University Press (OUP)