Explicit retrofitting of distributional word vectors
dc.contributor.author | Glavaš, G | |
dc.contributor.author | Vulić, I | |
dc.date.accessioned | 2018-11-14T00:31:02Z | |
dc.date.available | 2018-11-14T00:31:02Z | |
dc.date.issued | 2018 | |
dc.identifier.isbn | 9781948087322 | |
dc.identifier.uri | https://www.repository.cam.ac.uk/handle/1810/285039 | |
dc.description.abstract | Semantic specialization of distributional word vectors, referred to as retrofitting, is a process of fine-tuning word vectors using external lexical knowledge in order to better embed some semantic relation. Existing retrofitting models integrate linguistic constraints directly into learning objectives and, consequently, specialize only the vectors of words from the constraints. In this work, in contrast, we transform external lexico-semantic relations into training examples which we use to learn an explicit retrofitting model (ER). The ER model allows us to learn a global specialization function and specialize the vectors of words unobserved in the training data as well. We report large gains over original distributional vector spaces in (1) intrinsic word similarity evaluation and on (2) two downstream tasks -- lexical simplification and dialog state tracking. Finally, we also successfully specialize vector spaces of new languages (i.e., unseen in the training data) by coupling ER with shared multilingual distributional vector spaces. | |
dc.publisher | Association for Computational Linguistics | |
dc.title | Explicit retrofitting of distributional word vectors | |
dc.type | Conference Object | |
prism.endingPage | 45 | |
prism.publicationDate | 2018 | |
prism.publicationName | ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers) | |
prism.startingPage | 34 | |
prism.volume | 1 | |
dc.identifier.doi | 10.17863/CAM.32409 | |
dcterms.dateAccepted | 2018-04-21 | |
rioxxterms.versionofrecord | 10.18653/v1/p18-1004 | |
rioxxterms.licenseref.uri | http://www.rioxx.net/licenses/all-rights-reserved | |
rioxxterms.licenseref.startdate | 2018-01-01 | |
rioxxterms.type | Conference Paper/Proceeding/Abstract | |
pubs.funder-project-id | European Research Council (648909) | |
pubs.conference-name | Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) | |
pubs.conference-start-date | 2018-07 | |
pubs.conference-finish-date | 2018-07 | |
rioxxterms.freetoread.startdate | 2019-07-10 |
Files in this item
This item appears in the following Collection(s)
-
Cambridge University Research Outputs
Research outputs of the University of Cambridge