Inducing Embeddings for Rare and Unseen Words by Leveraging Lexical Resources
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics
15th Conference of the European Chapter of the Association for Computational Linguistics
Association for Computational Linguistics
2, Short Papers
MetadataShow full item record
Pilehvar, M., & Collier, N. (2017). Inducing Embeddings for Rare and Unseen Words by Leveraging Lexical Resources. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2, Short Papers 388-393. https://doi.org/10.17863/CAM.9211
We put forward an approach that exploits the knowledge encoded in lexical resources in order to induce representations for words that were not encountered frequently during training. Our approach provides an advantage over the past work in that it enables vocabulary expansion not only for morphological variations, but also for infrequent domain specific terms. We performed evaluations in different settings, showing that the technique can provide consistent improvements on multiple benchmarks across domains.
The authors gratefully acknowledge the support of the MRC grant No. MR/M025160/1 for PheneBank.
Embargo Lift Date
This record's DOI: https://doi.org/10.17863/CAM.9211
This record's URL: https://www.repository.cam.ac.uk/handle/1810/263834
Attribution 4.0 International
Licence URL: http://creativecommons.org/licenses/by/4.0/