Show simple item record

dc.contributor.authorPonti, Edoardo
dc.contributor.authorReichart, Roi
dc.contributor.authorKorhonen, Anna
dc.contributor.authorVulic, I
dc.date.accessioned2019-02-14T12:42:07Z
dc.date.available2019-02-14T12:42:07Z
dc.date.issued2018-07-10
dc.identifier.isbn9781948087322
dc.identifier.urihttps://www.repository.cam.ac.uk/handle/1810/289394
dc.description.abstractThe transfer or share of knowledge between languages is a popular solution to resource scarcity in NLP. However, the effectiveness of cross-lingual transfer can be challenged by variation in syntactic structures. Frameworks such as Universal Dependencies (UD) are designed to be cross-lingually consistent, but even in carefully designed resources trees representing equivalent sentences may not always overlap. In this paper, we measure cross-lingual syntactic variation, or anisomorphism, in the UD treebank collection, considering both morphological and structural properties. We show that reducing the level of anisomorphism yields consistent gains in cross-lingual transfer tasks. We introduce a source language selection procedure that facilitates effective cross-lingual parser transfer, and propose a typologically driven method for syntactic tree processing which reduces anisomorphism. Our results show the effectiveness of this method for both machine translation and cross-lingual sentence similarity, demonstrating the importance of syntactic structure compatibility for boosting cross-lingual transfer in NLP.
dc.publisherAssociation for Computational Linguistics
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleIsomorphic Transfer of Syntactic Structures in Cross-Lingual NLP
dc.typeConference Object
prism.endingPage1542
prism.publicationDate2018
prism.publicationNameProceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)
prism.startingPage1531
dc.identifier.doi10.18653/v1/P18-1142
dcterms.dateAccepted2018-04-21
rioxxterms.versionofrecord10.18653/v1/P18-1142
rioxxterms.versionVoR
rioxxterms.licenseref.urihttp://www.rioxx.net/licenses/all-rights-reserved
rioxxterms.licenseref.startdate2018-07-10
dc.contributor.orcidPonti, Edoardo [0000-0002-6308-1050]
rioxxterms.typeConference Paper/Proceeding/Abstract
pubs.funder-project-idEuropean Research Council (648909)
pubs.conference-name56th Annual Meeting of the Association for Computational Linguistics (ACL 2018)
pubs.conference-start-date2018-07-15
cam.orpheus.successThu Nov 05 11:53:28 GMT 2020 - The item has an open VoR version.
pubs.conference-finish-date2018-07-20
rioxxterms.freetoread.startdate2100-01-01


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution 4.0 International
Except where otherwise noted, this item's licence is described as Attribution 4.0 International