Adversarial propagation and zero-shot cross-lingual transfer of word vector specialization
View / Open Files
Authors
Ponti, EM
Vulić, I
Glavaš, G
Mrkšić, N
Korhonen, A
Publication Date
2018Journal Title
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018
Conference Name
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2018)
ISBN
9781948087841
Pages
282-293
Type
Conference Object
This Version
AM
Metadata
Show full item recordCitation
Ponti, E., Vulić, I., Glavaš, G., Mrkšić, N., & Korhonen, A. (2018). Adversarial propagation and zero-shot cross-lingual transfer of word vector specialization. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, 282-293. https://doi.org/10.17863/CAM.35175
Abstract
Semantic \specialization is a process of fine-tuning pre-trained distributional word vectors using external lexical knowledge (e.g., WordNet) to accentuate a particular semantic relation in the specialized vector space. While post-processing specialization methods are applicable to arbitrary distributional vectors, they are limited to updating only the vectors of words occurring in external lexicons (i.e., seen words), leaving the vectors of all other words unchanged. We propose a novel approach to specializing the full distributional vocabulary. Our adversarial post-specialization method propagates the external lexical knowledge to the full distributional space. We exploit words seen in the resources as training examples for learning a global specialization function. This function is learned by combining a standard L2-distance loss with a adversarial loss: the adversarial component produces more realistic output vectors. We show the effectiveness and robustness of the proposed method across three languages and on three tasks: word similarity, dialog state tracking, and lexical simplification. We report consistent improvements over distributional word vectors and vectors specialized by other state-of-the-art specialization frameworks. Finally, we also propose a cross-lingual transfer method for zero-shot specialization which successfully specializes a full target distributional space without any lexical knowledge in the target language and without any bilingual data.
Sponsorship
European Research Council (648909)
Identifiers
External DOI: https://doi.org/10.17863/CAM.35175
This record's URL: https://www.repository.cam.ac.uk/handle/1810/287860
Rights
Licence:
http://www.rioxx.net/licenses/all-rights-reserved
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk