Authors IndexSessionsTechnical programAttendees

 

Session: Large Vocabulary (Language Modeling and Speech Understanding)

Title: Improvement of Non-negative Matrix Factorization based Language Model using Exponential Models

Authors: Miroslav Novak, Richard Mammone

Abstract: This paper describes the use of exponential models to improve Non-negative Matrix Factorization (NMF) based topic language models for Automatic Speech Recognition. This modeling technique borrows the basic idea from Latent Semantic Analysis (LSA), which is typically used in Information Retrieval. An improvement was achieved when exponential models were used to estimate the a posteriori topic probabilities for an observed history. This method improved the perplexity of the NMF model, resulting in a 24 % perplexity improvement overall when compared to a trigram language model.

a01mn091.ps a01mn091.pdf