Autoregressive clustering for HMM speech synthesis
Proceedings of the 11th Annual Conference of the International Speech Communication
International Conference on Spoken Language Processing, Interspeech 2010
Curran Associates, Inc.
MetadataShow full item record
Shannon, S., & Byrne, W. (2011). Autoregressive clustering for HMM speech synthesis. Proceedings of the 11th Annual Conference of the International Speech Communication, 829-832. http://www.interspeech2010.org/
The autoregressive HMM has been shown to provide efficient parameter estimation and high-quality synthesis, but in previous experiments decision trees derived from a non-autoregressive system were used. In this paper we investigate the use of autoregressive clustering for autoregressive HMM-based speech synthesis. We describe decision tree clustering for the autoregressive HMM and highlight differences to the standard clustering procedure. Subjective listening evaluation results suggest that autoregressive clustering improves the naturalness of the resulting speech. We find that the standard minimum description length (MDL) criterion for selecting model complexity is inappropriate for the autoregressive HMM. Investigating the effect of model complexity on naturalness, we find that a large degree of overfitting is tolerated without a substantial decrease in naturalness.
This research was funded by the European Community's Seventh Framework Programme (FP7/2007-2013), grant agreement 213845 (EMIME).
External link: http://www.interspeech2010.org/
This record's URL: http://www.dspace.cam.ac.uk/handle/1810/226374
Attribution 2.0 UK: England & Wales
Licence URL: http://creativecommons.org/licenses/by/2.0/uk/
Recommended or similar items
The following licence files are associated with this item: