Paper: Mining Co-Occurrence Matrices for SO-PMI Paradigm Word Candidates

ACL ID E12-3009
Title Mining Co-Occurrence Matrices for SO-PMI Paradigm Word Candidates
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Student Session
Year 2012
Authors

This paper is focused on one aspect of SO- PMI, an unsupervised approach to senti- ment vocabulary acquisition proposed by Turney (Turney and Littman, 2003). The method, originally applied and evaluated for English, is often used in bootstrap- ping sentiment lexicons for European lan- guages where no such resources typically exist. In general, SO-PMI values are com- puted from word co-occurrence frequencies in the neighbourhoods of two small sets of paradigm words. The goal of this work is to investigate how lexeme selection affects the quality of obtained sentiment estima- tions. This has been achieved by compar- ing ad hoc random lexeme selection with two alternative heuristics, based on clus- tering and SVD decomposition of a word co-occurrence matrix, demonstrating supe- riority of the...