ACL Anthology Network (All About NLP) (beta) The Association Of Computational Linguistics Anthology Network |
ACL ID | P06-1084 |
---|---|
Title | An Unsupervised Morpheme-Based HMM For Hebrew Morphological Disambiguation |
Venue | Annual Meeting of the Association of Computational Linguistics |
Session | Main Conference |
Year | 2006 |
Authors |
|
Morphological disambiguation is the pro- cess of assigning one set of morphologi- cal features to each individual word in a text. When the word is ambiguous (there are several possible analyses for the word), a disambiguation procedure based on the word context must be applied. This paper deals with morphological disambiguation of the Hebrew language, which combines morphemes into a word in both agglutina- tive and fusional ways. We present an un- supervised stochastic model – the only re- source we use is a morphological analyzer – which deals with the data sparseness prob- lem caused by the affixational morphology of the Hebrew language. We present a text encoding method for languages with affixational morphology in which the knowledge of word formation rules (which are quite restric...