Phoneme and Sub-Phoneme T-Normalization for Text-Dependent Speaker Recognition.

Doroteo T. Toledano, Cristina Esteve-Elizalde, Joaquin Gonzalez-Rodriguez, Ruben Fernandez Pozo and Luis Hernandez Gomez.

Abstract 

Test normalization (T-Norm) is a score normalization technique that is regularly and successfully applied in the context of text-independent speaker recognition. It is less frequently applied, however, to text-dependent or text-prompted speaker recognition, mainly because its improvement in this context is more modest. In this paper we present a novel way to improve the performance of T-Norm for text-dependent systems. It consists in applying score T-Normalization at the phoneme or sub-phoneme level instead of at the sentence level. Experiments on the YOHO corpus show that, while using standard sentence-level T-Norm does not improve equal error rate (EER), phoneme and sub-phoneme level T-Norm produce a relative EER reduction of 18.9% and 20.1% respectively on a state-of-the-art HMM based text-dependent speaker recognition system. Results are even better for working points with low false acceptance rates.....

full text

Winelands1
Winelands2
1399019 vinyard
Winelands3

BuiltWithNOF

ABSTRACTS

sun-small
spescom1
Resize of Resize of ISCA_logo2