Paper: Computing Confidence Scores for All Sub Parse Trees

ACL ID P08-2055
Title Computing Confidence Scores for All Sub Parse Trees
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2008

Computing confidence scores for applica- tions, such as dialogue system, informa- tion retrieving and extraction, is an active research area. However, its focus has been primarily on computing word-, concept-, or utterance-level confidences. Motivated by the need from sophisticated dialogue systems for more effective dialogs, we generalize the confidence annotation to all the subtrees, the first effort in this line of research. The other contribution of this work is that we incorporated novel long distance features to address challenges in computing multi-level confidence scores. Using Conditional Maximum Entropy (CME) classifier with all the selected fea- tures, we reached an annotation error rate of 26.0% in the SWBD corpus, compared with a subtree error rate of 41.91%, a...