Authors IndexSessionsTechnical programAttendees

 

Session: Dialogue Systems (Voice Agents, Applications, and Field Trials)

Title: Improvements on a Semi-Automatic Grammar Induction Framework

Authors: Chin-chung Wong, Helen Meng

Abstract: This work extends the semi-automatic grammar induction approach previously proposed in [1]. The data-driven approach learns semantic and phrasal categories from a training corpus of unannotated natural language queries in a specific domain. Grammar rules are automatically acquired by an agglomerative clustering procedure, and the resulting grammar may be hand-edited easily for refinement. This work attempts to improve the grammar induction framework by leveraging information in the SQL query that accompanies every training query. The SQL expression specifies the action of database access in relation to the query, and hence provides information about meaningful natural language structures that should to be captured in induced grammar. We have also incorporated the use of Information Gain in place of Mutual Information to capture phrasal structures, as well as the determination of an automatic stopping criterion for agglomerative clustering.

a01wc135.ps a01wc135.pdf