Improving the Training and Evaluation Efficiency of Recurrent Neural Network Language Models

Chen, X; Liu, X; Gales, MJF; Woodland, PC

doi:10.1109/icassp.2015.7179003

Improving the Training and Evaluation Efficiency of Recurrent Neural Network Language Models

Repository URI

https://www.repository.cam.ac.uk/handle/1810/247435

Files

Chen_et_al-2015-ICASSP.pdf (81.94 KB)

Type

Conference Object

Authors

Abstract

Recurrent neural network language models (RNNLMs) are becoming increasingly popular for speech recognition. Previously, we have shown that RNNLMs with a full (non-classed) output layer (F-RNNLMs) can be trained efficiently using a GPU giving a large reduction in training time over conventional class-based models (C-RNNLMs) on a standard CPU. However, since test-time RNNLM evaluation is often performed entirely on a CPU, standard F-RNNLMs are inefficient since the entire output layer needs to be calculated for normalisation. In this paper, it is demonstrated that C-RNNLMs can be efficiently trained on a GPU, using our spliced sentence bunch technique which allows good CPU test-time performance $(42\times$ speedup over F-RNNLM). Furthermore, the performance of different classing approaches is investigated. We also examine the use of variance regularisation of the softmax denominator for F-RNNLMs and show that it allows F-RNNLMs to be efficiently used in test $(56\times$ speedup on a CPU). Finally the use of two GPUs for F-RNNLM training using pipelining is described and shown to give a reduction in training time over a single GPU by a factor of $16\times$.

Keywords

46 Information and Computing Sciences, 4611 Machine Learning, Machine Learning and Artificial Intelligence

Journal Title

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Conference Name

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Journal ISSN

1520-6149

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Publisher DOI

https://doi.org/10.1109/icassp.2015.7179003

Rights and licensing

Except where otherwised noted, this item's license is described as http://www.rioxx.net/licenses/all-rights-reserved

Sponsorship

Xie Chen is supported by Toshiba Research Europe Ltd, Cambridge Research Lab. The research leading to these results was also supported by EPSRC grant EP/I031022/1 (Natural Speech Technology) and DARPA under the Broad Operational Language Translation (BOLT) and RATS programs. The paper does not necessarily reflect the position or the policy of US Government and no official endorsement should be inferred.

Collections

Scholarly Works - Engineering
Symplectic mapped items for data match

Improving the Training and Evaluation Efficiency of Recurrent Neural Network Language Models

Repository URI

Repository DOI

Files

Type

Change log

Authors

Abstract

Description

Keywords

Journal Title

Conference Name

Journal ISSN

Volume Title

Publisher

Publisher DOI

Rights and licensing

Sponsorship

Collections