Invited Talk: ROLES OF HIGH-FIDELITY ACOUSTIC MODELING IN ROBUST SPEECH RECOGNITION
Li Deng, Microsoft Research, United States
S1.1: INTERPOLATION OF LOST SPEECH SEGMENTS USING LP-HNM MODEL WITH CODEBOOK-MAPPING POST-PROCESSING
Esfandiar Zavarehei, Saeed Vaseghi, Brunel University, United Kingdom
S1.2: SPEECH ENHANCEMENT USING PCA AND VARIANCE OF THE RECONSTRUCTION ERROR IN DISTRIBUTED SPEECH RECOGNITION
Amin Haji Abolhassani, INRS-Energie-Materiaux-Telecommunications, Canada; Sid-Ahmed Selouani, Universite de Moncton, Campus de Shippagan, Canada; Douglas O'Shaughnessy, INRS-Energie-Materiaux-Telecommunications, Canada
S1.3: DEVELOPMENT OF A PHONETIC SYSTEM FOR LARGE VOCABULARY ARABIC SPEECH RECOGNITION
Mark Gales, Frank Diehl, Chandra Raut, Marcus Tomalin, Phil Woodland, Kai Yu, Cambridge University, United Kingdom
S1.4: FACTOR ANALYSIS OF ACOUSTIC FEATURES FOR STREAMED HIDDEN MARKOV MODELS
Chuan-Wei Ting, Jen-Tzung Chien, National Cheng Kung University, Taiwan
S1.5: MONOLINGUAL AND CROSSLINGUAL COMPARISON OF TANDEM FEATURES DERIVED FROM ARTICULATORY AND PHONE MLPS
Özgür Çetin, Yahoo!, Inc., United States; Mathew Magimai-Doss, IDIAP Research Institute, Switzerland; Karen Livescu, Massachusetts Institute of Technology, United States; Arthur Kantor, University of Illinois, Urbana-Champaign, United States; Simon King, University of Edinburgh, United Kingdom; Chris Bartels, University of Washington, United States; Joe Frankel, University of Edinburgh, United Kingdom
S1.6: INCORPORATING THE VOICING INFORMATION INTO HMM-BASED AUTOMATIC SPEECH RECOGNITION
Peter Jancovic, Münevver Köküer, University of Birmingham, United Kingdom
S1.7: EXPLOITING COMPLEMENTARY ASPECTS OF PHONOLOGICAL FEATURES IN AUTOMATIC SPEECH RECOGNITION
Parya Momayyez, James Waterhouse, Richard Rose, McGill University, Canada
S1.8: ROBUST SPEECH RECOGNITION USING NOISE SUPPRESSION BASED ON MULTIPLE COMPOSITE MODELS AND MULTI-PASS SEARCH
Takatoshi Jitsuhiro, Tomoji Toriyama, Kiyoshi Kogure, ATR Knowledge Science Laboratories, Japan
S1.9: PREDICTIVE LINEAR TRANSFORMS FOR NOISE ROBUST SPEECH RECOGNITION
Mark Gales, Rogier van Dalen, Cambridge University, United Kingdom
S1.10: HIGH-PERFORMANCE HMM ADAPTATION WITH JOINT COMPENSATION OF ADDITIVE AND CONVOLUTIVE DISTORTIONS VIA VECTOR TAYLOR SERIES
Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero, Microsoft Corporation, United States
S1.11: MINIMUM MUTUAL INFORMATION BEAMFORMING FOR SIMULTANEOUS ACTIVE SPEAKERS
Kenichi Kumatani, University of Karlsruhe, Germany / IDIAP, Switzerland; Uwe Mayer, Tobias Gehrig, Emilian Stoimenov, University of Karlsruhe, Germany; John McDonough, University of Karlsruhe / Saarland University, Germany; Matthias Wölfel, University of Karlsruhe, Germany
S1.12: TWO EXTENSIONS TO ENSEMBLE SPEAKER AND SPEAKING ENVIRONMENT MODELING FOR ROBUST AUTOMATIC SPEECH RECOGNITION
Yu Tsao, Chin-Hui Lee, Georgia Institute of Technology, United States
S1.13: MODULATION SPECTRUM EQUALIZATION FOR ROBUST SPEECH RECOGNITION
Liang-che Sun, Chang-wen Hsu, Lin-shan Lee, Graduate Institute of Communication Engineering, National Taiwan University, Taiwan
S1.14: INVESTIGATING THE USE OF SPEECH FEATURES AND THEIR CORRESPONDING DISTRIBUTION CHARACTERISTICS FOR ROBUST SPEECH RECOGNITION
Shih-Hsiang Lin, Yao-Ming Yeh, Berlin Chen, National Taiwan Normal University, Taiwan
S1.15: JOINT DECODING OF MULTIPLE SPEECH PATTERNS FOR ROBUST SPEECH RECOGNITION
Nishanth Ulhas Nair, T. V. Sreenivas, Indian Institute of Science, India
S1.16: ROBUST SPEECH RECOGNITION BY PROPERLY UTILIZING RELIABLE FRAMES AND SEGMENTS IN CORRUPTED SIGNALS
Yi Chen, Chia-yu Wan, Lin-shan Lee, Graduate Institute of Communication Engineering, National Taiwan University, Taiwan
S1.17: ROBUST SPEECH RECOGNITION WITH ON-LINE UNSUPERVISED ACOUSTIC FEATURE COMPENSATION
Luis Buera, Antonio Miguel, Eduardo Lleida, Oscar Saz, Alfonso Ortega, University of Zaragoza, Spain
S1.18: DESIGN AND IMPLEMENTATION OF A ROBOT AUDITION SYSTEM FOR AUTOMATIC SPEECH RECOGNITION OF SIMULTANEOUS SPEECH
Shun'ichi Yamamoto, Kyoto University, Japan; Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, HRI-JP, Japan; Jean-Marc Valin, CSIRO, Australia; Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno, Kyoto University, Japan
S1.19: EXPERIMENTS ON CROSS-SYSTEM ACOUSTIC MODEL ADAPTATION
Diego Giuliani, Fabio Brugnara, Fondazione Bruno Kessler, Italy
Panel Discussion
Organizer: Hynek Hermansky
Panelists: Li Deng, Jeff Bilmes, Jordan Cohen, Ralf Schlueter, Herve Bourlard
Invited Talk: VOICE SEARCH – INFORMATION ACCESS VIA VOICE QUERIES
Ye-Yi Wang, Microsoft Research, United States
S2.1: HIERARCHICAL PITMAN-YOR LANGUAGE MODELS FOR ASR IN MEETINGS
Songfang Huang, Steve Renals, University of Edinburgh, United Kingdom
S2.2: ADAPTING GRAPHEME-TO-PHONEME CONVERSION FOR NAME RECOGNITION
Xiao Li, Asela Gunawardana, Alex Acero, Microsoft Research, United States
S2.3: GENERALIZED LINEAR INTERPOLATION OF LANGUAGE MODELS
Bo-June (Paul) Hsu, Massachusetts Institute of Technology, United States
S2.4: REFINE BIGRAM PLSA MODEL BY ASSIGNING LATENT TOPICS UNEVENLY
Jiazhong Nie, Runxin Li, Dingsheng Luo, Xihong Wu, Peking University, China
S2.5: EMPIRICAL STUDY OF THE NEURAL PROBABILISTIC LANGUAGE MODEL FOR ARABIC SPEECH RECOGNITION
Ahmad Emami, Lidia Mangu, IBM T. J. Watson Research Center, United States
S2.6: DISCRIMINATIVE LANGUAGE MODEL ADAPTATION FOR MANDARIN BROADCAST SPEECH TRANSCRIPTION AND TRANSLATION
Xunying Liu, Bill Byrne, Mark Gales, Adria de Gispert, Marcus Tomalin, Phil Woodland, Kai Yu, Cambridge University, United Kingdom
S2.7: RERANKING MACHINE TRANSLATION HYPOTHESES WITH STRUCTURED AND WEB-BASED LANGUAGE MODELS
Wen Wang, Andreas Stolcke, Jing Zheng, SRI International, United States
S2.8: DYNAMIC LANGUAGE MODELING FOR A DAILY BROADCAST NEWS TRANSCRIPTION SYSTEM
Ciro Martins, Aveiro University / L2F – Spoken Language Systems Lab – INESC-ID/IST, Portugal; António Teixeira, Aveiro University, Portugal; João Neto, L2F – Spoken Language Systems Lab – INESC-ID/IST, Portugal
S2.9: INVESTIGATING LINGUISTIC KNOWLEDGE IN A MAXIMUM ENTROPY TOKEN-BASED LANGUAGE MODEL
Jia Cui, Yi Su, Keith Hall, Frederick Jelinek, Center for Language and Speech Processing, United States
S2.10: ROBUST TOPIC INFERENCE FOR LATENT SEMANTIC LANGUAGE MODEL ADAPTATION
Aaron Heidel, Lin-shan Lee, National Taiwan University, Taiwan
S2.11: SPOKEN LANGUAGE UNDERSTANDING WITH KERNELS FOR SYNTACTIC/SEMANTIC STRUCTURES
Alessandro Moschitti, Giuseppe Riccardi, Christian Raymond, University of Trento, Italy
S2.12: SPOKEN DOCUMENT SUMMARIZATION USING RELEVANT INFORMATION
Yi-Ting Chen, Academia Sinica, Taiwan; Shih-Hsiang Lin, National Taiwan Normal University, Taiwan; Hsin-Min Wang, Academia Sinica, Taiwan; Berlin Chen, National Taiwan Normal University, Taiwan
S2.13: IMPROVING LECTURE SPEECH SUMMARIZATION USING RHETORICAL INFORMATION
Jian Zhang, Ho Yin Chan, Pascale Fung, Hong Kong University of Science and Technology, Hong Kong SAR of China
S2.14: AUTOMATIC DETECTION OF CONTRASTIVE ELEMENTS IN SPONTANEOUS SPEECH
Ani Nenkova, University of Pennsylvania, United States; Dan Jurafsky, Stanford University, United States
S2.15: CALL CLASSIFICATION FOR AUTOMATED TROUBLESHOOTING ON LARGE CORPORA
Keelan Evanini, University of Pennsylvania, United States; David Suendermann, Roberto Pieraccini, SpeechCycle, United States
S2.16: MAXIMUM ENTROPY MODEL PARAMETERIZATION WITH TF*IDF WEIGHTED VECTOR SPACE MODEL
Ye-Yi Wang, Alex Acero, Microsoft Research, United States
S2.17: A LANGUAGE MODELING APPROACH TO QUESTION ANSWERING ON SPEECH TRANSCRIPTS
Matthias H. Heie, Edward W. D. Whittaker, Josef R. Novak, Sadaoki Furui, Tokyo Institute of Technology, Japan
S2.18: AUTOMATIC LEXICAL PRONUNCIATIONS GENERATION AND UPDATE
Ghinwa Choueiter, Stephanie Seneff, James Glass, Massachusetts Institute of Technology, United States
S2.19: NON-NATIVE PRONUNCIATION VARIATION MODELING USING AN INDIRECT DATA-DRIVEN METHOD
Mina Kim, Yoo Rhee Oh, Hong Kook Kim, Gwangju Institute of Science and Technology (GIST), Republic of Korea
Panel Discussion
Organizer: Helen Meng
Panelists: Giuseppe Riccardi, Gary Geunbae Lee, Michiel Bacchiani, Tatsuya Kawahara, Ye-yi Wang
Invited Talk: SUBMODULARITY AND ADAPTATION
Jeff Bilmes, University of Washington, United States
S3.1: AGGLOMERATIVE INFORMATION BOTTLENECK FOR SPEAKER DIARIZATION OF MEETINGS DATA
Deepu Vijayasenan, Fabio Valente, Herve Boulard, IDIAP Research Institute, Switzerland
S3.2: EFFICIENT COMBINATION OF PARAMETRIC SPACES, MODELS AND METRICS FOR SPEAKER DIARIZATION - A MAXIMUM ENTROPY APPROACH
Themos Stafylakis, Vassilis Katsouros, George Carayannis, Athena - Research and Innovation Center in Information, Communication and Knowledge Technologies, Greece
S3.3: ROBUST SPEAKER CLUSTERING STRATEGIES TO DATA SOURCE VARIATION FOR IMPROVED SPEAKER DIARIZATION
Kyu Han, Samuel Kim, Shrikanth Narayanan, University of Southern California, United States
S3.4: A STUDY ON SOFT MARGIN ESTIMATION FOR LVCSR
Jinyu Li, Georgia Institute of Technology, United States; Zhijie Yan, University of Science and Technology of China, China; Chin-Hui Lee, Georgia Institute of Technology, United States; Ren-Hua Wang, University of Science and Technology of China, China
S3.5: HIERARCHICAL LARGE-MARGIN GAUSSIAN MIXTURE MODELS FOR PHONETIC CLASSIFICATION
Hung-An Chang, James Glass, Massachusetts Institute of Technology, United States
S3.6: AUTOMATIC SPEECH RECOGNITION BASED ON WEIGHTED MINIMUM CLASSIFICATION ERROR (W-MCE) TRAINING METHOD
Qiang Fu, Biing-Hwang Juang, Georgia Institute of Technology, United States
S3.7: TRAINING DATA SELECTION FOR IMPROVING DISCRIMINATIVE TRAINING OF ACOUSTIC MODELS
Shih-Hung Liu, Fang-Hui Chu, Shih-Hsiang Lin, Hung-Shin Lee, Berlin Chen, National Taiwan Normal University, Taiwan
S3.8: A CONSTRAINED LINE SEARCH APPROACH TO GENERAL DISCRIMINATIVE HMM TRAINING
Peng Liu, Microsoft Research Asia, China; Cong Liu, University of Science and Technology of China, China; Hui Jiang, York University, China; Frank Soong, Ren-Hua Wang, Microsoft Research Asia, China
S3.9: MIXTURE GAUSSIAN HMM-TRAJCTORY METHOD USING LIKELIHOOD COMPENSATION
Yasuhiro Minami, NTT Corporation, Japan
S3.10: STATE-DEPENDENT MIXTURE TYING WITH VARIABLE CODEBOOK SIZE FOR ACCENTED SPEECH RECOGNITION
Yi Liu, Fang Zheng, Center for Speech and Language Technologies, China; Lei He, Toshiba (China) Research and Development Center, China; Yunqing Xia, Center for Speech and Language Technologies, China
S3.11: BROAD PHONETIC CLASS RECOGNITION IN A HIDDEN MARKOV MODEL FRAMEWORK USING EXTENDED BAUM-WELCH TRANSFORMATIONS
Tara Sainath, Massachusetts Institute of Technology, United States; Dimitri Kanevsky, Bhuvana Ramabhadran, IBM T. J. Watson Research Center, United States
S3.12: A COMPACT SEMIDEFINITE PROGRAMMING (SDP) FORMULATION FOR LARGE MARGIN ESTIMATION OF HMMS IN SPEECH RECOGNITION
Yan Yin, Hui Jiang, York University, Canada
S3.13: HMM TRAINING BASED ON CV-EM AND CV GAUSSIAN MIXTURE OPTIMIZATION
Takahiro Shinozaki, Tatsuya Kawahara, Kyoto University, Japan
S3.14: VARIATIONAL KULLBACK-LEIBLER DIVERGENCE FOR HIDDEN MARKOV MODELS
John Hershey, Peder Olsen, Steven Rennie, IBM T. J. Watson Research Center, United States
S3.15: BAYESIAN ADAPTATION IN HMM TRAINING AND DECODING USING A MIXTURE OF FEATURE TRANSFORMS
Stavros Tsakalidis, Spyros Matsoukas, BBN Technologies, United States
S3.16: USE OF SYLLABLE NUCLEI LOCATIONS TO IMPROVE ASR
Chris Bartels, Jeff Bilmes, University of Washington, United States
S3.17: SPEECH RECOGNITION WITH LOCALIZED TIME-FREQUENCY PATTERN DETECTORS
Ken Schutte, James Glass, MIT Computer Science and Artificial Intelligence Laboratory, United States
S3.18: REGULARIZATION, ADAPTATION, AND NON-INDEPENDENT FEATURES IMPROVE HIDDEN CONDITIONAL RANDOM FIELDS FOR PHONE CLASSIFICATION
Yun-Hsuan Sung, Constantinos Boulis, Christopher Manning, Dan Jurafsky, Stanford University, United States
S3.19: DISCRIMINATIVE TRAINING OF MULTI-STATE BARGE-IN MODELS
Andrej Ljolje, Vincent Goffin, AT&T Labs - Research, United States
S3.20: GRAPH-BASED LEARNING FOR PHONETIC CLASSIFICATION
Andrei Alexandrescu, Katrin Kirchhoff, University of Washington, United States
Panel Discussion
Organizers: Jeff Bilmes, Tomoko Matsui
Panelists: Mark Gales, Fred Juang, Andrease Stolcke
Invited Talk: SPEECH-TRANSLATION: FROM DOMAIN-LIMITED TO DOMAIN-UNLIMITED TRANSLATION TASKS
Stephan Vogel, Carnegie Mellon University, United States
S4.1: CONSOLIDATION BASED SPEECH TRANSLATION
Chiori Hori, NiCT-ATR, Japan; Bing Zhao, Stephan Vogel, Alex Waibel, Carnegie Mellon University, United States
S4.2: LATTICE-BASED VITERBI DECODING TECHNIQUES FOR SPEECH TRANSLATION
George Saon, Michael Picheny, IBM T. J. Watson Research Center, United States
S4.3: SEMANTIC TRANSLATION ERROR RATE FOR EVALUATING TRANSLATION SYSTEMS
Krishna Subramanian, Dave Stallard, Rohit Prasad, Shirin Saleem, Prem Natarajan, BBN Technologies, United States
S4.4: THE RWTH ARABIC-TO-ENGLISH SPOKEN LANGUAGE TRANSLATION SYSTEM
Oliver Bender, Evgeny Matusov, Stefan Hahn, Sasa Hasan, Shahram Khadivi, Hermann Ney, RWTH Aachen University, Germany
S4.5: A COMPARISONAL STUDY OF THE MULTI-LAYER KOHONEN SELF-ORGANIZING FEATURE MAPS FOR SPOKEN LANGUAGE IDENTIFICATION
Liang Wang, Eliathamby Ambikairajah, The University of New South Wales, Australia; Eric H.C. Choi, National ICT Australia, Australia
S4.6: A NOVEL WEIGHTING TECHNIQUE FOR FUSING LANGUAGE IDENTIFICATION SYSTEMS BASED ON PAIR-WISE PERFORMANCES
Bo Yin, Eliathamby Ambikairajah, The University of New South Wales, Australia; Fang Chen, National ICT Australia Ltd., Australia
S4.7: NON-NATIVE SPEECH DATABASES
Martin Raab, Rainer Gruhn, Harman Becker Automotive Systems, Germany; Elmar Nöth, Universität Erlangen-Nürnberg, Germany
S4.8: DEALING WITH CROSS-LINGUAL ASPECTS IN SPOKEN NAME RECOGNITION
Frederik Stouten, Jean-Pierre Martens, University of Ghent, Belgium
S4.9: CROSSLINGUAL ACOUSTIC MODEL DEVELOPMENT FOR AUTOMATIC SPEECH RECOGNITION
Frank Diehl, Asunción Moreno, Enric Monte, Universitat Politècnica de Catalunya, Spain
S4.10: MULTI-STREAM DIALECT CLASSIFICATION USING SVM-GMM HYBRID CLASSIFIERS
Rahul Chitturi, John H. L. Hansen, University of Texas at Dallas, United States
S4.11: DERIVING SALIENT LEARNERS’ MISPRONUNCIATIONS FROM CROSS-LANGUAGE PHONOLOGICAL COMPARISONS
Helen Mei-Ling Meng, The Chinese University of Hong Kong, Hong Kong SAR of China; Yuen Yee Lo, Lan Wang, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Hong Kong SAR of China; Wing Yiu Lau, The Chinese University of Hong Kong, Hong Kong SAR of China
S4.12: THE TITECH LARGE VOCABULARY WFST SPEECH RECOGNITION SYSTEM
Paul Dixon, Tokyo Institute of Technology, Japan; Diamantino Caseiro, INESC-ID Lisboa, Portugal; Tasuku Oonishi, Sadaoki Furui, Tokyo Institute of Technology, Japan
S4.13: ADVANCES IN ARABIC BROADCAST NEWS TRANSCRIPTION AT RWTH
David Rybach, Stefan Hahn, Christian Gollan, Ralf Schlüter, Hermann Ney, RWTH Aachen University, Germany
S4.14: DEVELOPMENT OF THE 2007 RWTH MANDARIN LVCSR SYSTEM
Björn Hoffmeister, Christian Plahl, Peter Fritz, Georg Heigold, Jonas Lööf, Ralf Schlüter, Hermann Ney, RWTH Aachen University, Germany
S4.15: AN ALGORITHM FOR FAST COMPOSITION OF WEIGHTED FINITE-STATE TRANSDUCERS
John McDonough, Saarland University, Germany; Emilian Stoimenov, University of Karlsruhe, Germany; Dietrich Klakow, Saarland University, Germany
S4.16: A MANDARIN LECTURE SPEECH TRANSCRIPTION SYSTEM FOR SPEECH SUMMARIZATION
Ho Yin Chan, Jian Zhang, Pascale Fung, Lu Cao, Human Language Technology Center, University of Science and Technology, Hong Kong SAR of China
S4.17: THE IBM 2007 SPEECH TRANSCRIPTION SYSTEM FOR EUROPEAN PARLIAMENTARY SPEECHES
Bhuvana Ramabhadran, IBM T. J. Watson Research Center, United States; Olivier Siohan, Google Research, United States; Abhinav Sethy, University of Southern California, United States
S4.18: OOV DETECTION BY JOINT WORD/PHONE LATTICE ALIGNMENT
Hui Lin, Jeff Bilmes, University of Washington, United States; Dimitra Vergyri, SRI International, United States; Katrin Kirchhoff, University of Washington, United States
S4.19: UNCERTAINTY IN TRAINING LARGE VOCABULARY SPEECH RECOGNIZERS
Amarnag Subramanya, Chris Bartels, Jeff Bilmes, University of Washington, United States; Patrick Nguyen, Microsoft Research, United States
S4.20: BUILDING A HIGHLY ACCURATE MANDARIN SPEECH RECOGNIZER
Mei-Yuh Hwang, Gang Peng, University of Washington, United States; Wen Wang, SRI International, United States; Arlo Faria, ICSI Berkely, United States; Aaron Heidel, National Taiwan University, Taiwan; Mari Ostendorf, University of Washington, United States
Panel Discussion
Organizer: Satoshi Nakamura
Panelists: Prem Natarajan, Chiori Hori, Evgeny Matusov, George Saon, Stephan Vogel
Invited Talk: IMPLICIT USER-ADAPTIVE SYSTEM ENGAGEMENT IN SPEECH, PEN AND MULTIMODAL INTERFACES
Sharon Oviatt, Incaa Designs, United States
S5.1: USING PARTICLE FILTERS TO TRACK DIALOGUE STATE
Jason Williams, AT&T Labs - Research, United States
S5.2: A METHOD FOR EVALUATING AND COMPARING USER SIMULATIONS: THE CRAMER-VON MISES DIVERGENCE
Jason Williams, AT&T Labs - Research, United States
S5.3: A MULTI-LAYER ARCHITECTURE FOR SEMI-SYNCHRONOUS EVENT-DRIVEN DIALOGUE MANAGEMENT
Antoine Raux, Maxine Eskenazi, Carnegie Mellon University, United States
S5.4: DEVELOPMENT AND PORTABILITY OF ASR AND Q&A MODULES FOR REAL-ENVIRONMENT SPEECH-ORIENTED GUIDANCE SYSTEMS
Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano, Nara Institute of Science and Technology, Japan
S5.5: ERROR SIMULATION FOR TRAINING STATISTICAL DIALOGUE SYSTEMS
Jost Schatzmann, Blaise Thomson, Steve Young, Cambridge University, United Kingdom
S5.6: A DATA-CENTRIC ARCHITECTURE FOR DATA-DRIVEN SPOKEN DIALOG SYSTEMS
Sebastian Varges, Giuseppe Riccardi, University of Trento, Italy
S5.7: EXAMPLE-BASED ERROR RECOVERY STRATEGY FOR SPOKEN DIALOG SYSTEM
Cheongjae Lee, Sangkeun Jung, Donghyeon Lee, Gary Geunbae Lee, POSTECH, Republic of Korea
S5.8: TYPE-II DIALOGUE SYSTEMS FOR INFORMATION ACCESS FROM UNSTRUCTURED KNOWLEDGE SOURCES
Yi-Cheng Pan, Lin-shan Lee, National Taiwan University, Taiwan
S5.9: UNSUPERVISED STATE CLUSTERING FOR STOCHASTIC DIALOG MANAGEMENT
Fabrice Lefèvre, Renato De Mori, LIA - University of Avignon, France
S5.10: DYNAMIC VOCABULARY PREDICTION FOR ISOLATED-WORD DICTATION ON EMBEDDED DEVICES
Jussi Leppänen, Jilei Tian, Nokia Research Center, Finland
S5.11: DATA SELECTION FOR SPEECH RECOGNITION
Yi Wu, Rong Zhang, Alex Rudnicky, Carnegie Mellon University, United States
S5.12: TOWARDS BOTTOM-UP CONTINUOUS PHONE RECOGNITION
Sabato Marco Siniscalchi, Torbjørn Svendsen, Norwegian University of Science and Technology (NTNU), Norway; Chin-Hui Lee, Georgia Institute of Technology, United States
S5.13: A STUDY ON RESCORING USING HMM-BASED DETECTORS FOR CONTINUOUS SPEECH RECOGNITION
Qiang Fu, Biing-Hwang Juang, Georgia Institute of Technology, United States
S5.14: RANDOM DISCRIMINANT STRUCTURE ANALYSIS FOR AUTOMATIC RECOGNITION OF CONNECTED VOWELS
Yu Qiao, Satoshi Asakawa, Nobuaki Minematsu, The University of Tokyo, Japan
S5.15: PHONOLOGICAL FEATURE BASED VARIABLE FRAME RATE SCHEME FOR IMPROVED SPEECH RECOGNITION
Abhijeet Sangwan, John H. L. Hansen, Center for Robust Speech Systems (CRSS), University of Texas at Dallas, United States
S5.16: AN ENHANCED MINIMUM CLASSIFICATION ERROR LEARNING FRAMEWORK FOR BALANCING INSERTION, DELETION AND SUBSTITUTION ERRORS
Yuan-Fu Liao, National Taipei University of Technology, Taiwan; Jia-Jang Tu, Sen-Chia Chang, Industrial Technology Research Institute, Taiwan; Chin-Hui Lee, Georgia Institute of Technology, United States
S5.17: INTERPOLATIVE VARIABLE FRAME RATE TRANSMISSION OF SPEECH FEATURES FOR DISTRIBUTED SPEECH RECOGNITION
Huiqun Deng, Douglas O'Shaughnessy, University of Quebec, Canada; Jean Dahan, William F. Ganong, Nuance, Canada
S5.18: COMPARING ONE AND TWO-STAGE ACOUSTIC MODELING IN THE RECOGNITION OF EMOTION IN SPEECH
Björn Schuller, Technische Universität München, Germany; Bogdan Vlasenko, Otto-von-Guericke University Magdeburg, Germany; Ricardo Minguez, Gerhard Rigoll, Technische Universität München, Germany; Andreas Wendemuth, Otto-von-Guericke University Magdeburg, Germany
S5.19: EXTENSIBLE SPEECH RECOGNITION SYSTEM USING PROXY-AGENT
Teppei Nakano, Shinya Fujie, Tetsunori Kobayashi, Waseda University, Japan
S5.20: DEVELOPMENT OF VAD EVALUATION FRAMEWORK CENSREC-1-C AND INVESTIGATION OF RELATIONSHIP BETWEEN VAD AND SPEECH RECOGNITION PERFORMANCE
Norihide Kitaoka, Nagoya University, Japan; Kazumasa Yamamoto, Tomohiro Kusamizu, Seiichi Nakagawa, Toyohashi University of Technology, Japan; Takeshi Yamada, University of Tsukuba, Japan; Satoru Tsuge, The University of Tokushima, Japan; Chiyomi Miyajima, Nagoya University, Japan; Takanobu Nishiura, Masato Nakayama, Yuki Denda, Ritsumeikan University, Japan; Masakiyo Fujimoto, NTT Corporation, Japan; Tetsuya Takiguchi, Kobe University, Japan; Satoshi Tamura, Gifu University, Japan; Shingo Kuroiwa, The University of Tokushima, Japan; Kazuya Takeda, Nagoya University, Japan; Satoshi Nakamura, NiCT-ATR, Japan
Panel Discussion
Organizers: Sharon Oviatt, Kazuya Takeda
Panelists: John Hansen, Brandon Roy, Alex Acero
Invited Talk: VOICE/AUDIO INFORMATION RETRIEVAL: MINIMIZING THE NEED FOR HUMAN EARS
Mark Clements, Georgia Institute of Technology, United States; Marsal Gavalda, Nexidia, Inc., United States
S6.1: A SYSTEM FOR SPEECH DRIVEN INFORMATION RETRIEVAL
César González-Ferreras, Valentín Cardeñoso-Payo, Universidad de Valladolid, Spain
S6.2: TOWARDS SPOKEN-DOCUMENT RETRIEVAL FOR THE ENTERPRISE: APPROXIMATE WORD-LATTICE INDEXING WITH TEXT INDEXERS
Frank Seide, Peng Yu, Yu Shi, Microsoft Research Asia, China
S6.3: A STUDY OF LATTICE-BASED SPOKEN TERM DETECTION FOR CHINESE SPONTANEOUS SPEECH
Sha Meng, Tsinghua University, China; Peng Yu, Frank Seide, Microsoft Research Asia, China; Jia Liu, Tsinghua University, China
S6.4: FAST AUDIO SEARCH USING VECTOR-SPACE MODELLING
Brett Matthews, Upendra Chaudhari, Bhuvana Ramabhadran, IBM T. J. Watson Research Center, United States
S6.5: THE LIMSI QAST SYSTEMS: COMPARISON BETWEEN HUMAN AND AUTOMATIC RULES GENERATION FOR QUESTION-ANSWERING ON SPEECH TRANSCRIPTIONS
Sophie Rosset, Olivier Galibert, Gilles Adda, Eric Bilinski, LIMSI, France
S6.6: SOUNDBITE IDENTIFICATION USING REFERENCE AND AUTOMATIC TRANSCRIPTS OF BROADCAST NEWS SPEECH
Feifan Liu, Yang Liu, The University of Texas at Dallas, United States
S6.7: TOPIC IDENTIFICATION FROM AUDIO RECORDINGS USING WORD AND PHONE RECOGNITION LATTICES
Timothy Hazen, Fred Richardson, Anna Margolis, MIT Lincoln Laboratory, United States
S6.8: IMPROVEMENTS IN PHONE BASED AUDIO SEARCH VIA CONSTRAINED MATCH WITH HIGH ORDER CONFUSION ESTIMATES
Upendra Chaudhari, Michael Picheny, IBM T. J. Watson Research Center, United States
S6.9: INTEGRATING SEVERAL ANNOTATION LAYERS FOR STATISTICAL INFORMATION DISTILLATION
Michael Levit, Dilek Hakkani-Tür, International Computer Science Institute, United States; Gökhan Tur, SRI International, United States; Daniel Gillick, International Computer Science Institute, United States
S6.10: ANALYTICAL COMPARISON BETWEEN POSITION SPECIFIC POSTERIOR LATTICES AND CONFUSION NETWORKS BASED ON WORDS AND SUBWORD UNITS FOR SPOKEN DOCUMENT INDEXING
Yi-Cheng Pan, Hung-Lin Chang, Lin-shan Lee, National Taiwan University, Taiwan
S6.11: EFFICIENT USE OF OVERLAP INFORMATION IN SPEAKER DIARIZATION
Scott Otterson, Mari Ostendorf, University of Washington, United States
S6.12: SPEECHFIND FOR CDP: ADVANCES IN SPOKEN DOCUMENT RETRIEVAL FOR THE U. S. COLLABORATIVE DIGITIZATION PROGRAM
Wooil Kim, John H. L. Hansen, University of Texas at Dallas, United States
S6.13: A FAST-MATCH APPROACH FOR ROBUST, FASTER THAN REAL-TIME SPEAKER DIARIZATION
Yan Huang, Oriol Vinyals, Gerald Friedland, Christian Mueller, Nikki Mirghafori, Chuck Wooters, International Computer Science Institute, United States
S6.14: NEVER-ENDING LEARNING SYSTEM FOR ON-LINE SPEAKER DIARIZATION
Konstantin Markov, Satoshi Nakamura, NICT-ATR, Japan
S6.15: MULTIPLE FEATURE COMBINATION TO IMPROVE SPEAKER DIARIZATION OF TELEPHONE CONVERSATIONS
Vishwa Gupta, Patrick Kenny, Pierre Ouellet, Gilles Boulianne, Pierre Dumouchel, Centre de Recherche Informatique de Montreal, Canada
S6.16: SENSEI: SPOKEN ENGLISH ASSESSMENT FOR CALL CENTER AGENTS
Abhishek Chandel, Abhinav Parate, Maymon Madathingal, Himanshu Pant, Nitendra Rajput, Shajith Ikbal, Om Deshmukh, Ashish Verma, IBM India Research Lab, India
S6.17: TOWARDS ROBUST AUTOMATIC EVALUATION OF PATHOLOGIC TELEPHONE SPEECH
Korbinian Riedhammer, Universität Erlangen-Nürnberg, Germany; Georg Stemmer, Siemens AG, Germany; Tino Haderlein, Maria Schuster, Frank Rosanowski, Elmar Nöth, Andreas Maier, Universität Erlangen-Nürnberg, Germany
Panel Discussion
Organizer: Timothy Hazen
Panelists: Lin-shan Lee, Frank Seide, Steve Renals, Mark Clements