Paper: Blending Segmentation With Tagging In Chinese Language Corpus Processing

ACL ID C94-2209
Title Blending Segmentation With Tagging In Chinese Language Corpus Processing
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1994
Authors

this paper proposes a new method for Chinese language corpus processing. Unlike the past researches, our approach has following charactericstics : it blends segmentation with tagging and integrates nile-based approach with statistics-bascd one in grammatical dis- ambiguation. The principal ideas presented in the paper are incorporated in the development of a Chinese corpus processing system. Expcrimcntal results prove that the overall accuracy for segmentation is 97.68% and that for tagging is 94.55% in about 400,000 Chinese characters.