Research data supporting "Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems"
Change log
Authors
Wen, Tsung-Hsien https://orcid.org/0000-0002-7411-0478
Gasic, Milica
Mrksic, Nikola
Su, Pei-Hao
Vandyke, David
Description
This dataset is in JSON format and contains log files of interactions between a turn-taking spoken dialogue system and Amazon Mechanical turkers, collected from our previous live trials. It includes two application domains: San Francisco restaurants and hotels, each of them has around 1000 logs. The user responses are 1-best ASR hypothesis recognised by our ASR system, and the system responses were collected by running another round of data collection on AMT. The number of total collected system responses is around 5.1K for each domain. All users are anonymous.
This record supports publication and is available at http://mi.eng.cam.ac.uk/~thw28/papers/EMNLP15.pdf
Version
Software / Usage instructions
JSON
Keywords
natural language generation
Publisher
University of Cambridge
Sponsorship
“This work was supported by the Toshiba Research Europe, Cambridge Research Laboratory [grant number RG74649].