SpacerHome

Spacer
Mirror Sites
Spacer
General Information
Spacer
Confernce Schedule
Spacer
Technical Program
Spacer
     Plenary Sessions
Spacer
     Special Sessions
Spacer
     Expert Summaries
Spacer
     Tutorials
Spacer
     Industry Technology Tracks
Spacer
     Technical Sessions
Spacer
Tutorials
Spacer
Industry Technology Tracks
Spacer
Exhibits
Spacer
Sponsors
Spacer
Registration
Spacer
Coming to Phoenix
Spacer
Call for Papers
Spacer
Author's Kit
Spacer
On-line Review
Spacer
Future Conferences
Spacer
Help

Afternoon Tutorial 4

Conference Logo

Robust Automatic Speech Recognition for Unknown Environment Compensation

Jean-Claude Junqua
Speech Technology Laboratory
 

This tutorial summarizes the main difficulties encountered in Automatic Speech Recognition (ASR) when the type of communication channel is not known. This problem is crucial for the development of successful applications in promising domains such as computer telephony and cars. The main technical problems encountered are due to the speaker (e.g. speaking style, Lombard reflex, vocal tract geometry), the use of microphones with different degrees of spectral slope, the variable quality of the support channels (e.g. telephone channels are noisy and have different characteristics), the reverberation and echoes, the variable distance and direction to the microphone introduced by hands-free recognition, and the ambient noise which distorts the input speech signals.

This tutorial will present these distortions along with their influence on automatic speech recognition. It will also present solutions currently in use in state-of-the-art ASR systems including feature and model-based compensation along with speaker adaptive techniques. Finally, it will highlight some promising directions (e.g. characterization of the recognizer sensitivity to the source of distortion, use of a priori knowledge) for future research.

About the Tutorial Speaker

Jean-Claude Junqua received his Engineer degree (1980) from ENSEM (France) in Electronics and Automation, his Master and Doctorate degrees (in 1981 and 1989, respectively) and the "Habilitation a diriger des recherches" (1993) from the University of Nancy I (France) in the field of Computer Science. From 1981 to 1986 he was responsible of the computer facilities of CRIN (Research Center in Computer Science of Nancy, France). From 1987 to 1988 he was visiting researcher at Speech Technology Laboratory in Santa Barbara, California. In 1989, he joined Speech Technology Laboratory. From April 1992 to August 1993 he was visiting researcher at Matsushita, Osaka, Japan. Currently heading the speech recognition group and Group Director at Speech Technology Laboratory, his work has concentrated on improving robustness of automatic speech recognizers for small to medium size vocabularies. His current interests cover all aspects of automatic speech recognition, e.g. the study of noisy Lombard and channel distorted speech recognition, context-dependent phone modeling, and the design of dialogue systems. He is the author of more than 100 articles or patents in the above areas along with a recent book entitled "Robustness in Automatic Speech Recognition". He served as a chairman at several International conferences and participated to various International scientific committees. In 1992 and 1997 he co-organized two ESCA workshops entitled, respectively, "Speech Processing in Adverse Conditions" and "Robust Speech Recognition for Unknown Communication Channels". He is currently in the Editorial Board of the Speech Communication Journal, Associate Editor of the IEEE Transactions on Speech and Audio Processing, and member of IEEE and ESCA.

Return to overview of Tutorials.


Last Update:  February 4, 1999         Ingo Höntsch
Return to Top of Page