Repository logo
 

Fundamental frequency estimation in speech signals with variable rate particle filters

Accepted version
Peer-reviewed

Repository DOI


Type

Article

Change log

Authors

Zhang, G 
Godsill, S 

Abstract

Fundamental frequency estimation, known as pitch estimation in speech signals is of interest both to the research community and to industry. Meanwhile, the particle filter is known to be a powerful Bayesian inference method to track dynamic parameters in nonlinear state-space models. In this paper, we propose a speech model under a time-varying source-filter speech model, and use variable rate particle filters (VRPF) to develop methods for estimation of pitch periods in speech signals. A Rao–Blackwellised variable rate particle filter (RBVRPF) is also implemented. The proposed VRPF and RBVRPF are compared with a state-of-the-art pitch estimation algorithm, the YIN algorithm. Simulation results show that more accurate estimation of pitch can be obtained by VRPF and RBVRPF even under strong background noise conditions.

Description

Keywords

variable rate particle filters, pitch estimation, Rao-Blackwellisation, source-filter model

Journal Title

IEEE/ACM Transactions on Audio Speech and Language Processing

Conference Name

Journal ISSN

2329-9290
2329-9304

Volume Title

24

Publisher

Institute of Electrical and Electronics Engineers (IEEE)
Sponsorship
Engineering and Physical Sciences Research Council (EP/K020153/1)
The authors would like to thank CSC Cambridge International Scholarship and Natural Science Foundation of China (No.61463035) for providing financial support.