Paper: Segmenting Sentences Into Linky Strings Using D-Bigram Statistics

ACL ID C96-2099
Title Segmenting Sentences Into Linky Strings Using D-Bigram Statistics
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1996
Authors

It is obvious that segmentation takes an important role in natural language processing(NLP), especially for the lan- guages whose sentences are not eas- ily separated into morphemes. In this study we propose a method of segment- ing a sentence. The system described in this paper does not use any gram- matical information or knowledge in processing. Instead, it uses statistical information drawn from non-tagged cor- pus of the target language. Most of the segmenting systems are to pick out conventional morphemes which is de- fined for human use. However, we still do not know whether those conventional morphemes are good units for compu- tational processing. In this paper we explain our system's algorithm and its experimental results on Japanese, though this system is not designed for a part...