Paper: An IBM-PC Environment For Chinese Corpus Analysis

ACL ID C94-1096
Title An IBM-PC Environment For Chinese Corpus Analysis
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1994
Authors

This paper describes a set of computer programs for Chinese corpus analysis. These programs include (1) extraction of different characters, bigrams and words; (2) word segmentation based on bigram, maximal-matching and the combined technique; (3) identification of special terms; (4) Chinese concordancing; (5) compiling collocation statistics and (6) evaluation utilities. These programs run on the IBM- PC and batch programs co-ordinate the use of these programs.