Repository logo
 

Ontology design patterns to disambiguate relations between genes and gene products in GENIA.


Change log

Authors

Hoehndorf, Robert 
Ngonga Ngomo, Axel-Cyrille 
Pyysalo, Sampo 
Ohta, Tomoko 
Oellrich, Anika 

Abstract

MOTIVATION: Annotated reference corpora play an important role in biomedical information extraction. A semantic annotation of the natural language texts in these reference corpora using formal ontologies is challenging due to the inherent ambiguity of natural language. The provision of formal definitions and axioms for semantic annotations offers the means for ensuring consistency as well as enables the development of verifiable annotation guidelines. Consistent semantic annotations facilitate the automatic discovery of new information through deductive inferences. RESULTS: We provide a formal characterization of the relations used in the recent GENIA corpus annotations. For this purpose, we both select existing axiom systems based on the desired properties of the relations within the domain and develop new axioms for several relations. To apply this ontology of relations to the semantic annotation of text corpora, we implement two ontology design patterns. In addition, we provide a software application to convert annotated GENIA abstracts into OWL ontologies by combining both the ontology of relations and the design patterns. As a result, the GENIA abstracts become available as OWL ontologies and are amenable for automated verification, deductive inferences and other knowledge-based applications. AVAILABILITY: Documentation, implementation and examples are available from http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/.

Description

Keywords

0801 Artificial Intelligence and Image Processing

Journal Title

J Biomed Semantics

Conference Name

Journal ISSN

2041-1480
2041-1480

Volume Title

Publisher

Springer Science and Business Media LLC