Repository logo
 

Inducing Embeddings for Rare and Unseen Words by Leveraging Lexical Resources

Published version
Peer-reviewed

Change log

Authors

Pilehvar, MT 

Abstract

We put forward an approach that exploits the knowledge encoded in lexical resources in order to induce representations for words that were not encountered frequently during training. Our approach provides an advantage over the past work in that it enables vocabulary expansion not only for morphological variations, but also for infrequent domain specific terms. We performed evaluations in different settings, showing that the technique can provide consistent improvements on multiple benchmarks across domains.

Description

Keywords

Journal Title

Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics

Conference Name

15th Conference of the European Chapter of the Association for Computational Linguistics

Journal ISSN

Volume Title

2, Short Papers

Publisher

Association for Computational Linguistics
Sponsorship
Medical Research Council (MR/M025160/1)
The authors gratefully acknowledge the support of the MRC grant No. MR/M025160/1 for PheneBank.