Paper: A Probabilistic Approach To Compound Noun Indexing In Korean Texts

ACL ID C96-1087
Title A Probabilistic Approach To Compound Noun Indexing In Korean Texts
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1996
Authors

In this paper we address the prob- lem of compound noun indexing that is about segmenting or decomposing compound nouns into promising index terms. Compound nouns as index terms that usually subscribe to specific no- tions tend to increase the precision of retrieval performance. The use of the component nouns of a compound noun as index terms, on the other hand, may improve the recall performance, but can decrease the precision. Our proposed method to handle com- pound nouns with a goal to increase the recall while preserving the preci- sion computes the relevance of the com- ponent nouns of a compound noun to the document content by comparing the document sets that are supported by the component nouns and the terms of the document. The operational content of a term is represented as the p...