Paper: Japanese Named Entity Extraction With Redundant Morphological Analysis

ACL ID N03-1002
Title Japanese Named Entity Extraction With Redundant Morphological Analysis
Venue Human Language Technologies
Session Main Conference
Year 2003
Authors

Named Entity (NE) extraction is an important subtask of document processing such as in- formation extraction and question answering. A typical method used for NE extraction of Japanese texts is a cascade of morphological analysis, POS tagging and chunking. However, there are some cases where segmentation gran- ularity contradicts the results of morphologi- cal analysis and the building units of NEs, so that extraction of some NEs are inherently im- possible in this setting. To cope with the unit problem, we propose a character-based chunk- ing method. Firstly, the input sentence is an- alyzed redundantly by a statistical morpholog- ical analyzer to produce multiple (n-best) an- swers. Then, each character is annotated with its character types and its possible POS tags of the top n-best ans...