Accelerating NMT Batched Beam Decoding with LMBR Posteriors for Deployment
View / Open Files
Authors
Iglesias, Gonzalo
Tambellini, William
de Gispert, Adrià
Hasler, Eva
Byrne, WJ
Journal Title
http://aclweb.org/anthology/N18-1000
Conference Name
Proceedings of the North Americal Association of Computational Linguistics and Human Language Technologies Conference (NAACL-HLT ) 2018
Type
Conference Object
This Version
AM
Metadata
Show full item recordCitation
Iglesias, G., Tambellini, W., de Gispert, A., Hasler, E., & Byrne, W. Accelerating NMT Batched Beam Decoding with LMBR Posteriors for Deployment. http://aclweb.org/anthology/N18-1000 https://doi.org/10.17863/CAM.35282
Abstract
We describe a batched beam decoding algorithm for NMT with LMBR n-gram posteriors, showing that LMBR techniques still yield gains on top of the best recently reported results with Transformers. We also discuss acceleration strategies for deployment, and the effect of the beam size and batching on memory and speed.
Identifiers
External DOI: https://doi.org/10.17863/CAM.35282
This record's URL: https://www.repository.cam.ac.uk/handle/1810/287962
Rights
Licence:
http://www.rioxx.net/licenses/all-rights-reserved
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk