Authors IndexSessionsTechnical programAttendees

 

Session: ASR Robustness (Feature Extraction,Acoustic Modeling and Adaptation)

Title: Speaker-Trained Recognition using Allophonic Enrollment Models

Authors: Vincent Vanhoucke, Michael Hochberg, Christopher Leggetter

Abstract: We introduce a method for performing speaker-trained recognition based on context-dependent allophone models from a large-vocabulary, speaker-independent recognition system. In this approach, a set of speaker-enrollment templates is selected from the context-dependent allophone models. These templates are used to build representations of the speaker-enrolled utterances. The advantages of this approach include improved performance and portability of the enrollments across different acoustic models.
We describe the approach used to select the enrollment templates and how to apply them to speaker-trained recognition. The approach has been evaluated on an over-the-telephone, voice-activated dialing task and shows significant performance improvements over techniques based on context-independent phone models or general acoustic model templates. In addition, the portability of enrollments from one model set to another is shown to result in almost no performance degradation.

a01vv101.ps a01vv101.pdf