Multi-domain neural network language generation for spoken dialogue systems

Wen, TH; Gašić, M; Mrkšić, N; Rojas-Barahona, LM; Su, PH; Vandyke, D; Young, S

Multi-domain neural network language generation for spoken dialogue systems

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/256144

Repository DOI

https://doi.org/10.17863/CAM.84

Files

Version of Record (687.31 KB)

Type

Article

Authors

Wen, TH

Gašić, M

Mrkšić, N

Rojas-Barahona, LM

Su, PH

Show 2 more

Abstract

Moving from limited-domain natural language generation (NLG) to open domain is difficult because the number of semantic input combinations grows exponentially with the number of domains. Therefore, it is important to leverage existing resources and exploit similarities between domains to facilitate domain adaptation. In this paper, we propose a procedure to train multi-domain, Recurrent Neural Network-based (RNN) language generators via multiple adaptation steps. In this procedure, a model is first trained on counterfeited data synthesised from an out-of-domain dataset, and then fine tuned on a small set of in-domain utterances with a discriminative objective function. Corpus-based evaluation results show that the proposed procedure can achieve competitive performance in terms of BLEU score and slot error rate while significantly reducing the data needed to train generators in new, unseen domains. In subjective testing, human judges confirm that the procedure greatly improves generator performance when only a small amount of data is available in the domain.

Keywords

cs.CL, cs.CL

Journal Title

2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference

Publisher

Association for Computational Linguistics

Publisher DOI

https://doi.org/10.18653/v1/n16-1015

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Sponsorship

Toshiba Research Europe Ltd.

Collections

Scholarly Works - Engineering
Symplectic mapped items for data match