Authors IndexSessionsTechnical programAttendees

 

Session: Wireless ASR, Distributed Speech Recognition and Hands Free Interaction

Title: JOINT ESTIMATION OF NOISE AND CHANNEL DISTORTION IN A GENERALIZED EM FRAMEWORK

Authors: Trausti Kristjansson, Brendan Frey, Li Deng, Alex Acero

Abstract: The performance of speech cleaning and noise adaptation algorithms is heavily dependent on the quality of the noise and channel models. Various strategies have been proposed in the literature for adapting to the current noise and channel conditions. In this paper, we describe the joint learning of noise and channel distortion in a novel framework called ALGONQUIN. The learning algorithm employs a generalized EM strategy wherein the E step is approximate. We discuss the characteristics of the new algorithm, with a focus on convergence rates and parameter initialization. We show that the learning algorithm can successfully disentangle the non-linear effects of a noise and linear effects of the channel, and achieve recognition results similar to those obtained when the noise and channel models are known.

a01tk134.ps a01tk134.pdf