Bridging languages through images with deep partial canonical correlation analysis

Rotman, G; Vulić, I; Reichart, R

Bridging languages through images with deep partial canonical correlation analysis

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/280100

Repository DOI

https://doi.org/10.17863/CAM.27464

Files

Accepted version (1.12 MB)

Type

Conference Object

Authors

Rotman, G

Vulić, I

Reichart, R

Abstract

We present a deep neural network that leverages images to improve bilingual text embeddings. Relying on bilingual image tags and descriptions, our approach conditions text embedding induction on the shared visual information for both languages, producing highly correlated bilingual embeddings. In particular, we propose a novel model based on Partial Canonical Correlation Analysis (PCCA). While the original PCCA finds linear projections of two views in order to maximize their canonical correlation conditioned on a shared third variable, we introduce a non-linear Deep PCCA (DPCCA) model, and develop a new stochastic iterative algorithm for its optimization. We evaluate PCCA and DPCCA on multilingual word similarity and cross-lingual image description retrieval. Our models outperform a large variety of previous methods, despite not having access to any visual signal during test time inference. Our code and data are available at: https://github.com/rotmanguy/DPCCA}

Journal Title

ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers)

Conference Name

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Volume Title

1

Publisher

Association for Computational Linguistics

Publisher DOI

https://doi.org/10.18653/v1/p18-1084

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Sponsorship

European Research Council (648909)

Collections

Cambridge University Research Outputs