Show simple item record

dc.contributor.authorBudzianowski, Pawel
dc.contributor.authorRamadan, Osman
dc.contributor.authorGasic, Milica
dc.date.accessioned2018-08-09T07:02:43Z
dc.date.available2018-08-09T07:02:43Z
dc.identifier.urihttps://www.repository.cam.ac.uk/handle/1810/278720
dc.descriptionThe Multi-Domain Wizard-of-Oz dataset (MultiWOZ), a collection of human-human written conversations spanning over multiple domains and topics. The dataset was collected based on the Wizard of Oz experiment on Amazon MTurk. Each dialogue contains a goal label and several exchanges between a visitor and the system. Each system turn has labels from the set of slot-value pairs representing a coarse representation of dialogue state. There are in total 9855 dialogues.
dc.description.sponsorshipThe data collection was funded through Google Faculty Award.
dc.formatDataset contains the following seven json files: 1. data.json: the woz dialogue dataset, which contains the conversion from users and wizards, as well as a set of coarse labels for each user turn. 2. restaurant_db.json: the Cambridge restaurant database file, containing restaurants in the Cambridge UK area and a set of attributes. 3. attraction_db.json: the Cambridge attraction database file, contining attractions in the Cambridge UK area and a set of attributes. 4. hotel_db.json: the Cambridge hotel database file, containing hotels in the Cambridge UK area and a set of attributes. 5. train_db.json: the Cambridge train (with artificial connections) database file, containing trains in the Cambridge UK area and a set of attributes. 6. valListFile.json: list of dialogues for validation. 7. testListFile.json: list of dialogues for testing.
dc.rightsAttribution 4.0 International (CC BY 4.0)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectdialogue system
dc.subjectbelief tracking
dc.subjectwizard of oz
dc.titleResearch data supporting "Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing"
dc.typeDataset
dc.identifier.doi10.17863/CAM.26059
rioxxterms.licenseref.urihttps://creativecommons.org/licenses/by/4.0/
datacite.contributor.supervisorGasic, Milica
dcterms.formatJSON
dc.contributor.orcidBudzianowski, Pawel [0000-0003-0013-7931]
dc.contributor.orcidGasic, Milica [0000-0003-0318-9147]
rioxxterms.typeOther
datacite.issupplementto.urlhttp://aclweb.org/anthology/P18-2069


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution 4.0 International (CC BY 4.0)
Except where otherwise noted, this item's licence is described as Attribution 4.0 International (CC BY 4.0)