Accelerating NMT Batched Beam Decoding with LMBR Posteriors for Deployment

Iglesias, Gonzalo; Tambellini, William; de Gispert, Adrià; Hasler, Eva; Byrne, WJ

Accelerating NMT Batched Beam Decoding with LMBR Posteriors for Deployment

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/287962

Repository DOI

https://doi.org/10.17863/CAM.35282

Files

Accepted version (402.75 KB)

Type

Conference Object

Authors

Iglesias, Gonzalo

Tambellini, William

de Gispert, Adrià

Hasler, Eva

Byrne, WJ

Abstract

We describe a batched beam decoding algorithm for NMT with LMBR n-gram posteriors, showing that LMBR techniques still yield gains on top of the best recently reported results with Transformers. We also discuss acceleration strategies for deployment, and the effect of the beam size and batching on memory and speed.

Journal Title

http://aclweb.org/anthology/N18-1000

Conference Name

Proceedings of the North Americal Association of Computational Linguistics and Human Language Technologies Conference (NAACL-HLT ) 2018

Publisher DOI

https://doi.org/10.17863/CAM.35282

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Collections

Cambridge University Research Outputs