Research data supporting "Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing"
dc.contributor.author | Budzianowski, Pawel | |
dc.contributor.author | Ramadan, Osman | |
dc.contributor.author | Gasic, Milica | |
dc.date.accessioned | 2018-08-09T07:02:43Z | |
dc.date.available | 2018-08-09T07:02:43Z | |
dc.identifier.uri | https://www.repository.cam.ac.uk/handle/1810/278720 | |
dc.description | The Multi-Domain Wizard-of-Oz dataset (MultiWOZ), a collection of human-human written conversations spanning over multiple domains and topics. The dataset was collected based on the Wizard of Oz experiment on Amazon MTurk. Each dialogue contains a goal label and several exchanges between a visitor and the system. Each system turn has labels from the set of slot-value pairs representing a coarse representation of dialogue state. There are in total 9855 dialogues. | |
dc.description.sponsorship | The data collection was funded through Google Faculty Award. | |
dc.format | Dataset contains the following seven json files: 1. data.json: the woz dialogue dataset, which contains the conversion from users and wizards, as well as a set of coarse labels for each user turn. 2. restaurant_db.json: the Cambridge restaurant database file, containing restaurants in the Cambridge UK area and a set of attributes. 3. attraction_db.json: the Cambridge attraction database file, contining attractions in the Cambridge UK area and a set of attributes. 4. hotel_db.json: the Cambridge hotel database file, containing hotels in the Cambridge UK area and a set of attributes. 5. train_db.json: the Cambridge train (with artificial connections) database file, containing trains in the Cambridge UK area and a set of attributes. 6. valListFile.json: list of dialogues for validation. 7. testListFile.json: list of dialogues for testing. | |
dc.rights | Attribution 4.0 International (CC BY 4.0) | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | dialogue system | |
dc.subject | belief tracking | |
dc.subject | wizard of oz | |
dc.title | Research data supporting "Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing" | |
dc.type | Dataset | |
dc.identifier.doi | 10.17863/CAM.26059 | |
rioxxterms.licenseref.uri | https://creativecommons.org/licenses/by/4.0/ | |
datacite.contributor.supervisor | Gasic, Milica | |
dcterms.format | JSON | |
dc.contributor.orcid | Budzianowski, Pawel [0000-0003-0013-7931] | |
dc.contributor.orcid | Gasic, Milica [0000-0003-0318-9147] | |
rioxxterms.type | Other | |
datacite.issupplementto.url | http://aclweb.org/anthology/P18-2069 |