Paper: Support Vector Machine Approach To Extracting Gene References Into Function From Biological Documents

ACL ID W04-1209
Title Support Vector Machine Approach To Extracting Gene References Into Function From Biological Documents
Venue International Joint Workshop On Natural Language Processing In Biomedicine And Its Applications NLPBA BioNLP
Session
Year 2004
Authors

In the biological domain, extracting newly discovered functional features from the massive literature is a major challenging issue. To automatically annotate Gene References into Function (GeneRIF) in a new literature is the main goal of this paper. We tried to find GRIF words in a training corpus, and then applied these informative words to annotate the GeneRIFs in abstracts with several different weighting schemes. The experiments showed that the Classic Dice score is at most 50.18%, when the weighting schemes proposed in the paper (Hou et al. , 2003) were adopted. In contrast, after employing Support Vector Machines (SVMs) and the definition of classes proposed by Jelier et al. (2003), the score greatly improved to 56.86% for Classic Dice (CD). Adopting the same features, SVMs demonstra...