Inducing Embeddings for Rare and Unseen Words by Leveraging Lexical Resources
Published version
Peer-reviewed
Repository URI
Repository DOI
Type
Conference Object
Change log
Authors
Pilehvar, MT
Collier, Nigel https://orcid.org/0000-0002-7230-4164
Abstract
We put forward an approach that exploits the knowledge encoded in lexical resources in order to induce representations for words that were not encountered frequently during training. Our approach provides an advantage over the past work in that it enables vocabulary expansion not only for morphological variations, but also for infrequent domain specific terms. We performed evaluations in different settings, showing that the technique can provide consistent improvements on multiple benchmarks across domains.
Description
Keywords
Journal Title
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics
Conference Name
15th Conference of the European Chapter of the Association for Computational Linguistics
Journal ISSN
Volume Title
2, Short Papers
Publisher
Association for Computational Linguistics
Publisher DOI
Sponsorship
Medical Research Council (MR/M025160/1)
The authors gratefully acknowledge the support of the MRC grant No. MR/M025160/1 for PheneBank.