Keep the Primary, Rewrite the Secondary: A Two-Stage Approach for Paraphrase Generation
dc.contributor.author | Su, Yixuan | |
dc.contributor.author | Vandyke, D | |
dc.contributor.author | Baker, S | |
dc.contributor.author | Wang, Y | |
dc.contributor.author | Collier, Nigel | |
dc.date.accessioned | 2022-05-27T23:30:31Z | |
dc.date.available | 2022-05-27T23:30:31Z | |
dc.date.issued | 2021-01-01 | |
dc.identifier.isbn | 9781954085541 | |
dc.identifier.uri | https://www.repository.cam.ac.uk/handle/1810/337572 | |
dc.description.abstract | Paraphrase generation is an important and challenging NLG problem. In this work, we propose a new Identification-then-Aggregation (IA) framework to tackle this task. In the identification step, the input tokens are sorted into two groups by a novel Primary/Secondary Identification (PSI) algorithm. In the aggregation step, these groups are separately encoded, before being aggregated by a custom designed decoder, which autoregressively generates the paraphrased sentence. In extensive experiments on two benchmark datasets, we demonstrate that our model outperforms previous studies by a notable margin. We also show that the proposed approach can generate paraphrases in an interpretable and controllable way. | |
dc.rights | Publisher's own licence | |
dc.title | Keep the Primary, Rewrite the Secondary: A Two-Stage Approach for Paraphrase Generation | |
dc.type | Conference Object | |
dc.publisher.department | Department of Theoretical & Applied Linguistics | |
dc.publisher.department | Faculty of Modern And Medieval Languages And Linguistics | |
dc.date.updated | 2022-05-27T06:31:56Z | |
prism.endingPage | 569 | |
prism.publicationDate | 2021 | |
prism.publicationName | Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 | |
prism.startingPage | 560 | |
dc.identifier.doi | 10.17863/CAM.84981 | |
rioxxterms.versionofrecord | 10.17863/CAM.84981 | |
rioxxterms.version | VoR | |
dc.contributor.orcid | Su, Yixuan [0000-0002-1472-7791] | |
dc.contributor.orcid | Collier, Nigel [0000-0002-7230-4164] | |
cam.orpheus.counter | 8 | * |
cam.depositDate | 2022-05-27 | |
pubs.licence-identifier | apollo-deposit-licence-2-1 | |
pubs.licence-display-name | Apollo Repository Deposit Licence Agreement | |
rioxxterms.freetoread.startdate | 2100-01-01 |
Files in this item
This item appears in the following Collection(s)
-
Cambridge University Research Outputs
Research outputs of the University of Cambridge