Paper: Unsupervised Approaches for Automatic Keyword Extraction Using Meeting Transcripts

ACL ID N09-1070
Title Unsupervised Approaches for Automatic Keyword Extraction Using Meeting Transcripts
Venue Human Language Technologies
Session Main Conference
Year 2009
Authors

This paper explores several unsupervised ap- proaches to automatic keyword extraction using meeting transcripts. In the TFIDF (term frequency, inverse document frequency) weighting framework, we incorporated part- of-speech(POS)information,wordclustering, and sentence salience score. We also evalu- ated a graph-based approach that measures the importance of a word based on its connection with other sentences or words. The system performanceisevaluatedin differentways, in- cluding comparison to human annotated key- words using F-measure and a weighted score relative to the oracle system performance, as well as a novel alternative human evaluation. Our results have shown that the simple un- supervised TFIDF approach performs reason- ably well, and the additional information from POS and sent...