Research data supporting "On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems"

Name: Research data supporting "On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems"
Published: 2016-05-17T08:46:54Z
Keywords: spoken dialogue systems, deep learning, reward modelling, reinforcement learning, Gaussian process

Su, Pei-Hao; Gasic, Milica; Mrksic, Nikola; Rojas-Barahona, Lina; Ultes, Stefan; Vandyke, David; Wen, Tsung-Hsien; Young, Steve

Research data supporting "On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems"

Repository URI

https://www.repository.cam.ac.uk/handle/1810/256020

Files

acl2016-online-reward.zip (119.04 MB)

README (1.91 KB)

Type

Dataset

Authors

Description

This repository contains the data presented in the paper "On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems" in ACL 2016. Two separate datasets as described in section 4 of the paper are presented: 1. DialogueEmbedding/ It contains the [train|valid|test] data for the unsupervised dialogue embedding creation, each with *.[feature|reward|turn|subjsuc]. Note that *.turn includes the lines to be read for each dialogue in *.[feature|reward|subjsuc], and *.subjsuc is the user's subjective rating. The feature size is 74. 2. DialoguePolicy/ It includes four contrasting systems with different reward models: [GP|RNN|ObjSubj|Subj]. Inside each system directory is the data obtained in interaction with Amazon Mechanical Turk users while training three policies with same config: policy_[1|2|3]. and a .csv for the evaluation result along with the trainig process. In each policy_[1|2|3]/ there is a list of calls with a time stamp in the name which contains session.xml file for dialogue log and feedback.xml file for user feedback

This research data supports "On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems" which has been published in "Proceedings of Association for Computational Linguistics (ACL)".

Software / Usage instructions

csv, xml, README

Keywords

spoken dialogue systems, deep learning, reward modelling, reinforcement learning, Gaussian process

Publisher

University of Cambridge

Rights and licensing

Except where otherwised noted, this item's license is described as Attribution 2.0 UK: England & Wales

Sponsorship

This work was supported by the EPSRC [grant number Cambridge Trust].

Collections

Research Data - Engineering
Symplectic mapped items for data match

Research data supporting "On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems"

Repository URI

Repository DOI

Files

Type

Change log

Authors

Description

Version

Software / Usage instructions

Keywords

Publisher

Rights and licensing

Sponsorship

Collections