Show simple item record

dc.contributor.authorStahlberg, Felixen
dc.contributor.authorde Gispert, Aen
dc.contributor.authorHasler, Evaen
dc.contributor.authorByrne, Williamen
dc.date.accessioned2017-04-27T11:10:28Z
dc.date.available2017-04-27T11:10:28Z
dc.date.issued2017-04-07en
dc.identifier.urihttps://www.repository.cam.ac.uk/handle/1810/263838
dc.description.abstractWe present a novel scheme to combine neural machine translation (NMT) with traditional statistical machine translation (SMT). Our approach borrows ideas from linearised lattice minimum Bayes-risk decoding for SMT. The NMT score is combined with the Bayes-risk of the translation according the SMT lattice. This makes our approach much more flexible than n-best list or lattice rescoring as the neural decoder is not restricted to the SMT search space. We show an efficient and simple way to integrate risk estimation into the NMT decoder which is suitable for word-level as well as subword-unit-level NMT. We test our method on English-German and Japanese-English and report significant gains over lattice rescoring on several data sets for both single and ensembled NMT. The MBR decoder produces entirely new hypotheses far beyond simply rescoring the SMT search space or fixing UNKs in the NMT output.
dc.description.sponsorshipThis work was supported by the U.K. Engineering and Physical Sciences Research Council (EPSRC grant EP/L027623/1).
dc.language.isoenen
dc.publisherAssociation for Computational Linguistics
dc.rightsAttribution 4.0 Internationalen
dc.rightsAttribution 4.0 Internationalen
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.titleNeural Machine Translation by Minimising the Bayes-risk with Respect to Syntactic Translation Latticesen
dc.typeConference Object
prism.endingPage368
prism.publicationDate2017en
prism.publicationNameProceedings of the 15th Conference of the European Chapter of the Association for Computational Linguisticsen
prism.startingPage362
prism.volume2, Short Papersen
dc.identifier.doi10.17863/CAM.9215
dcterms.dateAccepted2017-02-01en
rioxxterms.versionVoRen
rioxxterms.licenseref.urihttp://creativecommons.org/licenses/by/4.0/en
rioxxterms.licenseref.startdate2017-04-07en
dc.contributor.orcidStahlberg, Felix [0000-0002-0430-5704]
rioxxterms.typeConference Paper/Proceeding/Abstracten
pubs.conference-name15th Conference of the European Chapter of the Association for Computational Linguisticsen
pubs.conference-start-date2017-04-03en
cam.orpheus.successThu Nov 05 11:57:28 GMT 2020 - The item has an open VoR version.*
rioxxterms.freetoread.startdate2100-01-01


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution 4.0 International
Except where otherwise noted, this item's licence is described as Attribution 4.0 International