Paper: The GENIA Project: Corpus-Based Knowledge Acquisition And Information Extraction From Genome Research Papers

ACL ID E99-1043
Title The GENIA Project: Corpus-Based Knowledge Acquisition And Information Extraction From Genome Research Papers
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 1999
Authors

We present an outline of the genome in- formation acquisition (GENIA) project for automatically extracting biochemical information from journal papers and ab- stracts. GENIA will be available over the Internet and is designed to aid in information extraction, retrieval and vi- sualisation and to help reduce informa- tion overload on researchers. The vast repository of papers available online in databases such as MEDLINE is a natu- ral environment in which to develop lan- guage engineering methods and tools and is an opportunity to show how language engineering can play a key role on the Internet.