Paper: Detection Of Entity Mentions Occuring In English And Chinese Text

ACL ID H05-1048
Title Detection Of Entity Mentions Occuring In English And Chinese Text
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2005
Authors

In this paper, we describe an integrated approach to entity mention detection that yields a monolithic, almost language in- dependent system. It is optimal in the sense that all categorical constraints are si- multaneously considered. The system is compact and easy to develop and main- tain, since only a single set of features and classi ers are needed to be designed and optimized. It is implemented using one- versus-all support vector machine (SVM) classi ers and a number of feature extrac- tors at several linguistic levels. SVMs are well known for their ability to han- dle a large set of overlapping features with theoretically sound generalization proper- ties. Data sparsity might be an impor- tant issue as a result of a large number of classes and relatively moderate train- ing data siz...