Paper: Recognizing Unregistered Names For Mandarin Word Identification

ACL ID C92-4199
Title Recognizing Unregistered Names For Mandarin Word Identification
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1992
Authors

Word Identification has been an important and ac- tive issue in Chinese Natural Language Processing. In this paper, a new mechanism, based on the concept of sublanguage, is proposed for identifying unknown words, especially personal names, in Chinese newspa- pers. The proposed mechanism includes title.driven name recognition, adaptive dynamic word formation, identification of Z-character and 3-character Chinese names without title. We will show the e~:perimental results for two corpora and compare them with the re- sults by the NTIIU's statistic-based system, the only system that we know has attacked the same problem. The ezperimental results have shown significant im- provements over the WI systems without the name identification capability.