Dialogue response selection with hierarchical curriculum learning
dc.contributor.author | Su, Y | |
dc.contributor.author | Cai, D | |
dc.contributor.author | Zhou, Q | |
dc.contributor.author | Lin, Z | |
dc.contributor.author | Baker, S | |
dc.contributor.author | Cao, Y | |
dc.contributor.author | Shi, S | |
dc.contributor.author | Collier, N | |
dc.contributor.author | Wang, Y | |
dc.date.accessioned | 2022-05-27T23:30:35Z | |
dc.date.available | 2022-05-27T23:30:35Z | |
dc.date.issued | 2021 | |
dc.identifier.isbn | 9781954085527 | |
dc.identifier.uri | https://www.repository.cam.ac.uk/handle/1810/337573 | |
dc.description.abstract | We study the learning of a matching model for dialogue response selection. Motivated by the recent finding that models trained with random negative samples are not ideal in real-world scenarios, we propose a hierarchical curriculum learning framework that trains the matching model in an “easy-to-difficult” scheme. Our learning framework consists of two complementary curricula: (1) corpus-level curriculum (CC); and (2) instance-level curriculum (IC). In CC, the model gradually increases its ability in finding the matching clues between the dialogue context and a response candidate. As for IC, it progressively strengthens the model's ability in identifying the mismatching information between the dialogue context and a response candidate. Empirical studies on three benchmark datasets with three state-of-the-art matching models demonstrate that the proposed learning framework significantly improves the model performance across various evaluation metrics. | |
dc.publisher | Association for Computational Linguistics | |
dc.rights | Attribution 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.title | Dialogue response selection with hierarchical curriculum learning | |
dc.type | Conference Object | |
dc.publisher.department | Department of Theoretical & Applied Linguistics | |
dc.publisher.department | Faculty of Modern And Medieval Languages And Linguistics | |
dc.date.updated | 2022-05-27T06:34:45Z | |
prism.endingPage | 1751 | |
prism.publicationDate | 2021 | |
prism.publicationName | ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference | |
prism.startingPage | 1740 | |
dc.identifier.doi | 10.17863/CAM.84982 | |
dcterms.dateAccepted | 2021-05-05 | |
rioxxterms.versionofrecord | 10.18653/v1/2021.acl-long.137 | |
rioxxterms.version | VoR | |
dc.contributor.orcid | Su, Yixuan [0000-0002-1472-7791] | |
dc.contributor.orcid | Collier, Nigel [0000-0002-7230-4164] | |
pubs.conference-name | Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) | |
pubs.conference-start-date | 2021-08 | |
cam.orpheus.success | 2022-06-22 | |
cam.orpheus.counter | 3 | |
cam.depositDate | 2022-05-27 | |
pubs.conference-finish-date | 2021-08 | |
pubs.licence-identifier | apollo-deposit-licence-2-1 | |
pubs.licence-display-name | Apollo Repository Deposit Licence Agreement |
Files in this item
This item appears in the following Collection(s)
-
Cambridge University Research Outputs
Research outputs of the University of Cambridge