Paper: An Unsupervised Morpheme-Based HMM For Hebrew Morphological Disambiguation

ACL ID P06-1084
Title An Unsupervised Morpheme-Based HMM For Hebrew Morphological Disambiguation
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2006
Authors

Morphological disambiguation is the pro- cess of assigning one set of morphologi- cal features to each individual word in a text. When the word is ambiguous (there are several possible analyses for the word), a disambiguation procedure based on the word context must be applied. This paper deals with morphological disambiguation of the Hebrew language, which combines morphemes into a word in both agglutina- tive and fusional ways. We present an un- supervised stochastic model – the only re- source we use is a morphological analyzer – which deals with the data sparseness prob- lem caused by the affixational morphology of the Hebrew language. We present a text encoding method for languages with affixational morphology in which the knowledge of word formation rules (which are quite restric...