Recurrent neural network language models for keyword search

Chen, X; Ragni, A; Vasilakes, J; Liu, X; Knill, K; Gales, MJF

Recurrent neural network language models for keyword search

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/267092

Repository DOI

https://doi.org/10.17863/CAM.13112

Files

Accepted version (88.07 KB)

Type

Conference Object

Authors

Chen, X

Ragni, A

Vasilakes, J

Liu, X

Knill, Katherine

https://orcid.org/0000-0003-1292-2769

Show 1 more

Abstract

Recurrent neural network language models (RNNLMs) have becoming increasingly popular in many applications such as automatic speech recognition (ASR). Significant performance improvements in both perplexity and word error rate over standard n-gram LMs have been widely reported on ASR tasks. In contrast, published research on using RNNLMs for keyword search systems has been relatively limited. In this paper the application of RNNLMs for the IARPA Babel keyword search task is investigated. In order to supplement the limited acoustic transcription data, large amounts of web texts are also used in large vocabulary design and LM training. Various training criteria were then explored to improved RNNLMs' efficiency in both training and evaluation. Significant and consistent improvements on both keyword search and ASR tasks were obtained across all languages.

Keywords

speech recognition, keyword search, language model, recurrent neural network

Journal Title

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Conference Name

The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing

Journal ISSN

1520-6149

Publisher

IEEE

Publisher DOI

https://doi.org/10.1109/ICASSP.2017.7953263

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Sponsorship

IARPA (4912046943)

Xie Chen is supported by Toshiba Research Europe Ltd, Cambridge Research Lab. This work was supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Defense U. S. Army Research Laboratory (DoD/ARL) contract number W911NF-12-C-0012. The U. S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon. Disclaimer: The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of IARPA, DoD/ARL, or the U. S. Government.

Collections

Scholarly Works - Engineering