ACL Anthology Network (All About NLP) (beta) The Association Of Computational Linguistics Anthology Network |
ACL ID | D09-1080 |
---|---|
Title | Integrating sentence- and word-level error identification for disfluency correction |
Venue | Conference on Empirical Methods in Natural Language Processing |
Session | Main Conference |
Year | 2009 |
Authors |
|
While speaking spontaneously, speakers often make errors such as self-correction or false starts which interfere with the successful application of natural language processing techniques like summarization and machine translation to this data. There is active work on reconstructing this error- ful data into a clean and fluent transcript by identifying and removing these simple errors. Previous research has approximated the potential benefit of conducting word-level reconstruction of simple errors only on those sentences known to have errors. In this work, we explore new approaches for automatically identifying speaker con- struction errors on the utterance level, and quantifytheimpactthatthisinitialstephas on word- and sentence-level reconstruc- tion accuracy.