Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction

Gibson, M; Hirsimaki, T; Karhila, R; Kurimo, M; Byrne, WJ

Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction

Repository URI

http://www.dspace.cam.ac.uk/handle/1810/226329

Files

icassp2010-mgibson.pdf (71.41 KB)

Type

Conference Object

Authors

Gibson, M

Hirsimaki, T

Karhila, R

Kurimo, M

Byrne, WJ

Abstract

This paper demonstrates how unsupervised cross-lingual adaptation of HMM-based speech synthesis models may be performed without explicit knowledge of the adaptation data language. A two-pass decision tree construction technique is deployed for this purpose. Using parallel translated datasets, cross-lingual and intralingual adaptation are compared in a controlled manner. Listener evaluations reveal that the proposed method delivers performance approaching that of unsupervised intralingual adaptation.

Keywords

HMM-based speech synthesis, unsupervised speaker adaptation, cross-lingual

Journal Title

Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing

Conference Name

IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP' 10

Publisher

IEEE

Publisher DOI

https://doi.org/10.1109/ICASSP.2010.5495196

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Sponsorship

European Commission (213845)

Collections

Scholarly Works - Engineering - Information Engineering
Symplectic mapped items for data match