Paper: Towards Effective Sentence Simplification for Automatic Processing of Biomedical Text

ACL ID N09-2045
Title Towards Effective Sentence Simplification for Automatic Processing of Biomedical Text
Venue Human Language Technologies
Session Short Paper
Year 2009
Authors

The complexity of sentences characteristic to biomedical articles poses a challenge to natu- ral language parsers, which are typically trained on large-scale corpora of non-technical text. We propose a text simplification process, bioSimplify, that seeks to reduce the complex- ity of sentences in biomedical abstracts in or- der to improve the performance of syntactic parsers on the processed sentences. Syntactic parsing is typically one of the first steps in a text mining pipeline. Thus, any improvement in performance would have a ripple effect over all processing steps. We evaluated our method using a corpus of biomedical sen- tences annotated with syntactic links. Our em- pirical results show an improvement of 2.90% for the Charniak-McClosky parser and of 4.23% for the Link ...