An initial investigation of long-term adaptation for meeting transcription

Chen, X; Gales, MJF; Knill, K; Breslin, C; Chen, L; Chin, KK; Wan, V

An initial investigation of long-term adaptation for meeting transcription

Repository URI

https://www.repository.cam.ac.uk/handle/1810/247412

Files

Chen_et_al-2014-INTERSPEECH_2014.pdf (141.28 KB)

Type

Conference Object

Authors

Chen, X

Gales, MJF

Knill, Katherine

https://orcid.org/0000-0003-1292-2769

Breslin, C

Chen, L

Show 2 more

Abstract

Meeting transcription is a very useful and challenging task. The majority of research to date has focused on individual meeting, or only a small group of meetings. In many practical deployments, multiple related meetings will take place over a long period of time. This paper describes an initial investigation of how this long-term data can be used to improve meeting transcription. A corpus of technical meetings, using a single microphone array, was collected over a two year period, yielding a total of 179 hours of meeting data. Baseline systems using deep neural network acoustic models, in both Tandem and Hybrid configurations, and neural network-based language models are described. The impact of supervised and unsupervised adaptation of the acoustic models is then evaluated, as well as the impact of improved language models.

Keywords

Meeting Transcription, Unsupervised Adaptation, Confidence Score, MAP, MLLR

Journal Title

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Conference Name

Interspeech 2014

Journal ISSN

2308-457X
1990-9772

Publisher

ISCA

Publisher DOI

https://doi.org/10.21437/interspeech.2014-253

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Sponsorship

Xie Chen would like to thank Toshiba Research Europe Ltd, Cambridge Research Lab, for funding his work. The authors would like to thank the Toshiba Cambridge Speech Group for allowing the data to be collected, also would like to thank Chao Zhang and Eric Wang for providing DNN and CMLLR transform tools.

Collections

Scholarly Works - Engineering
Symplectic mapped items for data match