Research Data Supporting "Multi-domain Neural Network Language Generation for Spoken Dialogue Systems"
R.-Barahona, Lina M.
University of Cambridge
MetadataShow full item record
Tsung-Hsien, W., Gasic, M., Mrksic, N., R.-Barahona, L. M., Pei-Hao, S., Vandyke, D., & Young, S. (2016). Research Data Supporting "Multi-domain Neural Network Language Generation for Spoken Dialogue Systems" [Dataset]. https://www.repository.cam.ac.uk/handle/1810/255930
This is a natural language generation dataset collected from Amazon Mechanical Turk used in this paper "Multi-domain Neural Network Language Generation for Spoken Dialogue Systems" in NAACL-HLT 2016. It contains two domains regarding to consumer electronics: laptop and TV. Each file is in JSON format and contains a list of tuples. The three elements are dialogue act (semantic representation), sentence generated from AMT workers, sentence generated from our handcrafted template generator. There are 13K and 7K distinct DA and sentence pairs in laptop and TV domain, respectively. All products are anonymous.
NLG, natural language generation, dialogue system
Publication Reference: http://www.aclweb.org/anthology/N16-1015
Toshiba Research Europe Ltd, Cambridge Research Laboratory
This record's URL: https://www.repository.cam.ac.uk/handle/1810/255930
Attribution 2.0 UK: England & Wales
Licence URL: http://creativecommons.org/licenses/by/2.0/uk/
Recommended or similar items
The following licence files are associated with this item: