Show simple item record

dc.contributor.authorBudzianowski, Pawelen
dc.contributor.authorWen, T-Hen
dc.contributor.authorGasic, Milicaen
dc.date.accessioned2018-09-21T14:46:01Z
dc.date.available2018-09-21T14:46:01Z
dc.identifier.urihttps://www.repository.cam.ac.uk/handle/1810/280608
dc.descriptionDataset contains the following json files: 1. data.json: the woz dialogue dataset, which contains the conversation users and wizards, as well as a set of coarse labels for each user turn. 2. restaurant_db.json: the Cambridge restaurant database file, containing restaurants in the Cambridge UK area and a set of attributes. 3. attraction_db.json: the Cambridge attraction database file, contining attractions in the Cambridge UK area and a set of attributes. 4. hotel_db.json: the Cambridge hotel database file, containing hotels in the Cambridge UK area and a set of attributes. 5. train_db.json: the Cambridge train (with artificial connections) database file, containing trains in the Cambridge UK area and a set of attributes. 6. hospital_db.json: the Cambridge hospital database file, contatining information about departments. 7. police_db.json: the Cambridge police station information. 8. taxi_db.json: slot-value list for taxi domain. 9. valListFile.json: list of dialogues for validation. 10. testListFile.json: list of dialogues for testing. 11. system_acts.json: system acts annotations 12. ontology.json: Data-based ontology.en
dc.description.sponsorshipThe data collection was funded through Google Faculty Award.en
dc.formatThe Multi-Domain Wizard-of-Oz dataset (MultiWOZ), a collection of human-human written conversations spanning over multiple domains and topics. The dataset was collected based on the Wizard of Oz experiment on Amazon MTurk. Each dialogue contains a goal label and several exchanges between a visitor and the system. Each system turn has labels from the set of slot-value pairs representing a coarse representation of dialogue state for both user and system. There are in total 10438 dialogues.en
dc.rightsAttribution 4.0 Internationalen
dc.rightsAttribution 4.0 Internationalen
dc.rightsAttribution 4.0 Internationalen
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectdialogue systemen
dc.subjectdataseten
dc.subjectwizard of ozen
dc.titleResearch data supporting "MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling"en
dc.typeDataset
dc.identifier.doi10.17863/CAM.27632
datacite.ispreviousversionof.doi10.17863/CAM.41572
rioxxterms.licenseref.urihttp://creativecommons.org/licenses/by/4.0/en
datacite.contributor.supervisorGasic, Milica
dcterms.formatJSONen
dc.contributor.orcidBudzianowski, Pawel [0000-0003-0013-7931]
dc.contributor.orcidGasic, Milica [0000-0003-0318-9147]
rioxxterms.typeOtheren


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution 4.0 International
Except where otherwise noted, this item's licence is described as Attribution 4.0 International