Repository logo
 

Domain Adaptive Inference for Neural Machine Translation

Accepted version
Peer-reviewed

Type

Conference Object

Change log

Authors

Gispert, Adria de 
Byrne, Bill 

Abstract

We investigate adaptive ensemble weighting for Neural Machine Translation, addressing the case of improving performance on a new and potentially unknown domain without sacrificing performance on the original domain. We adapt sequentially across two Spanish-English and three English-German tasks, comparing unregularized fine-tuning, L2 and Elastic Weight Consolidation. We then report a novel scheme for adaptive NMT ensemble decoding by extending Bayesian Interpolation with source information, and show strong improvements across test domains without access to the domain label.

Description

Keywords

cs.CL, cs.CL

Journal Title

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

Conference Name

Journal ISSN

Volume Title

Publisher

Rights

All rights reserved