Second-order contexts from lexical substitutes for few-shot learning of word representations

Liu, Q; McCarthy, D; Korhonen, A

Second-order contexts from lexical substitutes for few-shot learning of word representations

Published version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/297013

Repository DOI

https://doi.org/10.17863/CAM.44054

Files

Published version (246.99 KB)

Type

Conference Object

Authors

Liu, Q

McCarthy, D

Korhonen, A

Abstract

There is a growing awareness of the need to handle rare and unseen words in word representation modelling. In this paper, we focus on few-shot learning of emerging concepts that fully exploits only a few available contexts. We introduce a substitute-based context representation technique that can be applied on an existing word embedding space. Previous context-based approaches to modelling unseen words only consider bag-of-word firstorder contexts, whereas our method aggregates contexts as second-order substitutes that are produced by a sequence-aware sentence completion model. We experimented with three tasks that aim to test the modelling of emerging concepts. We found that these tasks show different emphasis on first and second order contexts, and our substitute-based method achieved superior performance on naturallyoccurring contexts from corpora.

Journal Title

*SEM@NAACL-HLT 2019 - 8th Joint Conference on Lexical and Computational Semantics

Conference Name

The Eighth Joint Conference on Lexical and Computational Semantics

Publisher DOI

https://doi.org/10.17863/CAM.44054

Rights

Attribution 4.0 International

Collections

Cambridge University Research Outputs