Speaker adaptation and adaptive training for jointly optimised tandem systems
View / Open Files
Authors
Wang, Y
Zhang, C
Gales, MJF
Woodland, PC
Publication Date
2018Journal Title
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Conference Name
Interspeech 2018
ISSN
2308-457X
ISBN
978-1-5108-7221-9
Publisher
ISCA
Volume
2018-September
Pages
872-876
Type
Conference Object
Metadata
Show full item recordCitation
Wang, Y., Zhang, C., Gales, M., & Woodland, P. (2018). Speaker adaptation and adaptive training for jointly optimised tandem systems. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2018-September 872-876. https://doi.org/10.21437/Interspeech.2018-2432
Abstract
Speaker independent (SI) Tandem systems trained by joint optimisation
of bottleneck (BN) deep neural networks (DNNs) and
Gaussian mixture models (GMMs) have been found to produce
similar word error rates (WERs) to Hybrid DNN systems. A
key advantage of using GMMs is that existing speaker adaptation
methods, such as maximum likelihood linear regression
(MLLR), can be used which to account for diverse speaker
variations and improve system robustness. This paper investigates
speaker adaptation and adaptive training (SAT) schemes
for jointly optimised Tandem systems. Adaptation techniques
investigated include constrained MLLR (CMLLR) transforms
based on BN features for SAT as well as MLLR and parameterised
sigmoid functions for unsupervised test-time adaptation.
Experiments using English multi-genre broadcast (MGB3) data
show that CMLLR SAT yields a 4% relative WER reduction
over jointly trained Tandem and Hybrid SI systems, and further
reductions in WER are obtained by system combination.
Keywords
Speech recognition, Tandem system, joint training, speaker adaptive training
Identifiers
External DOI: https://doi.org/10.21437/Interspeech.2018-2432
This record's URL: https://www.repository.cam.ac.uk/handle/1810/286180
Rights
Licence:
http://www.rioxx.net/licenses/all-rights-reserved
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk