Initializing neural networks for hierarchical multi-label text classification

Baker, S; Korhonen, A

doi:10.17863/CAM.12418

Initializing neural networks for hierarchical multi-label text classification

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/285913

Repository DOI

https://doi.org/10.17863/CAM.12418

Files

Accepted version (484.84 KB)

Type

Conference Object

Authors

Baker, Simon

https://orcid.org/0000-0002-0998-438X

Korhonen, A

Abstract

Many tasks in the biomedical domain require the assignment of one or more predefined labels to input text, where the labels are a part of a hierarchical structure (such as a taxonomy). The conventional approach is to use a one-vs.-rest (OVR) classification setup, where a binary classifier is trained for each label in the taxonomy or ontology where all instances not belonging to the class are considered negative examples. The main drawbacks to this approach are that dependencies between classes are not leveraged in the training and classification process, and the additional computational cost of training parallel classifiers. In this paper, we apply a new method for hierarchical multi-label text classification that initializes a neural network model final hidden layer such that it leverages label co-occurrence relations such as hypernymy. This approach elegantly lends itself to hierarchical classifi- cation. We evaluated this approach using two hierarchical multi-label text classification tasks in the biomedical domain using both sentence- and document-level classi- fication. Our evaluation shows promising results for this approach.

Journal Title

BioNLP 2017 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 16th BioNLP Workshop

Conference Name

BioNLP2017

Publisher

Association for Computational Linguistics

Publisher DOI

https://doi.org/10.17863/CAM.12418

Rights

Attribution 4.0 International

Sponsorship

Medical Research Council (G0601766)

Collections

University of Cambridge Research Outputs (Articles and Conferences)

Initializing neural networks for hierarchical multi-label text classification

Accepted version

Peer-reviewed

Repository URI

Repository DOI

Files

Type

Change log

Authors

Abstract

Description

Keywords

Journal Title

Conference Name

Journal ISSN

Volume Title

Publisher

Publisher DOI

Rights

Sponsorship

Collections