Paper: How To Get A Chinese Name (Entity): Segmentation And Combination Issues

ACL ID W03-1026
Title How To Get A Chinese Name (Entity): Segmentation And Combination Issues
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2003
Authors

When building a Chinese named entity recognition system, one must deal with certain language-specific issues such as whether the model should be based on characters or words. While there is no unique answer to this question, we discuss in detail advantages and disadvantages of each model, identify problems in segmen- tation and suggest possible solutions, pre- senting our observations, analysis, and experimental results. The second topic of this paper is classifier combination. We present and describe four classifiers for Chinese named entity recognition and describe various methods for combining their outputs. The results demonstrate that classifier combination is an effective tech- nique of improving system performance: experiments over a large annotated corpus of fine-grained entity typ...