610 - Speech formant estimation using Hilbert-Huang transform
Huang H., Pan J.
Abstract
A speech formant frequency estimation method based on Hilbert-Huang Transform (HHT) is proposed in this study. After filtered with band-pass filters with the centre-frequencies obtained by using the FFT analysis, speed data are decomposed into a set of Intrinsic Mode Function (IMF) by using the HHT analysis method. The IMFs containing formant frequencies are then identified according to the energy maximum criteria, and their instantaneous frequencies and Hilbert spectra are calculated, and finally, the formant frequencies of speech data are efficiently determined. The results in this study show that, compared with conventional formant estimation methods, the method based on HHT not only can give more clear descriptions of the non-linear and non-stationary characteristics of speech signals, but also the speech formant frequencies and their variations with high time-frequency resolution and veracity.
Citation
Huang H.; Pan J.: Speech formant estimation using Hilbert-Huang transform, CD-ROM Proceedings of the Thirtheenth International Congress on Sound and Vibration (ICSV13), July 2-6, 2006, Vienna, Austria, Eds.: Eberhardsteiner, J.; Mang, H.A.; Waubke, H., Publisher: Vienna University of Technology, Austria, ISBN: 3-9501554-5-7
|