Paper: Distributed Modules For Text Annotation And IE Applied To The Biomedical Domain

ACL ID W04-1208
Title Distributed Modules For Text Annotation And IE Applied To The Biomedical Domain
Venue International Joint Workshop On Natural Language Processing In Biomedicine And Its Applications NLPBA BioNLP
Session
Year 2004
Authors

Biological databases contain facts from scien- tific literature, which have been curated by hand to ensure high quality. Curation is time- consuming and can be supported by informa- tion extraction methods. We present a server which identifies biological facts in scientific text and presents the annotation to the curator. Such facts are: UniProt, UMLS and GO ter- minology, identification of gene and protein names, mutations and protein-protein interac- tions. UniProt, UMLS and GO concepts are automatically linked to the original source. The module for mutations is based on syntax pat- terns and the one for protein-protein interac- tions on NLP. All modules work independently of each other in single threads and are combined in a pipeline to ensure proper meta data inte- gration. For fast re...