Research data supporting "Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems"
Change log
Authors
Description
This dataset is in JSON format and contains log files of interactions between a turn-taking spoken dialogue system and Amazon Mechanical turkers, collected from our previous live trials. It includes two application domains: San Francisco restaurants and hotels, each of them has around 1000 logs. The user responses are 1-best ASR hypothesis recognised by our ASR system, and the system responses were collected by running another round of data collection on AMT. The number of total collected system responses is around 5.1K for each domain. All users are anonymous.
This record supports publication and is available at http://mi.eng.cam.ac.uk/~thw28/papers/EMNLP15.pdf
Version
Software / Usage instructions
JSON
Keywords
Publisher
University of Cambridge
Rights and licensing
Except where otherwised noted, this item's license is described as Attribution 2.0 UK: England & Wales
Sponsorship
“This work was supported by the Toshiba Research Europe, Cambridge Research Laboratory [grant number RG74649].