Paper: Positioning Unknown Words In A Thesaurus By Using Information Extracted From A Corpus

ACL ID C96-2161
Title Positioning Unknown Words In A Thesaurus By Using Information Extracted From A Corpus
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1996
Authors

This p~q)er describes a. method for positio,ing un- known words in an existing thesa,rus by using word- to-word rela.tionships with relation (case) markers extracted from a large corpus. A suitable area (if the thesaurus for an unknown woM ix estimated l)y inte- grating the human intuition I)urled in the thesaurus and statistical data extracted from the corpus. To overcome the prohlem of data sparseness, distin- guishing features of each node, called "viewpoints" are. extracted a.utomatically and used to calcMa.te the similarity between the unknown woM and a. word in the thesaurus. The results of a.tl experi- ment confirm the COrltril)ution of viewl)oints to the I)ositioning task.