Research data supporting "Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems"
University of Cambridge
MetadataShow full item record
Wen, T., Gasic, M., Mrksic, N., Su, P., Vandyke, D., & Young, S. (2015). Research data supporting "Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems" [Dataset]. https://www.repository.cam.ac.uk/handle/1810/251304
This dataset is in JSON format and contains log files of interactions between a turn-taking spoken dialogue system and Amazon Mechanical turkers, collected from our previous live trials. It includes two application domains: San Francisco restaurants and hotels, each of them has around 1000 logs. The user responses are 1-best ASR hypothesis recognised by our ASR system, and the system responses were collected by running another round of data collection on AMT. The number of total collected system responses is around 5.1K for each domain. All users are anonymous.
This record supports publication and is available at http://mi.eng.cam.ac.uk/~thw28/papers/EMNLP15.pdf
natural language generation
Publication Reference: http://mi.eng.cam.ac.uk/~thw28/papers/EMNLP15.pdf
“This work was supported by the Toshiba Research Europe, Cambridge Research Laboratory [grant number RG74649].
This record's URL: https://www.repository.cam.ac.uk/handle/1810/251304
Attribution 2.0 UK: England & Wales
Licence URL: http://creativecommons.org/licenses/by/2.0/uk/
Recommended or similar items
The following licence files are associated with this item: