Paper: Detecting Multiword Verbs In The English Sublanguage Of MEDLINE Abstracts

ACL ID C04-1124
Title Detecting Multiword Verbs In The English Sublanguage Of MEDLINE Abstracts
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2004
Authors

Chun Xiao and Dietmar R¨osner Institut f¨ur Wissens- und Sprachverarbeitung Otto-von-Guericke Universit¨at Magdeburg Universit¨atsplatz 2, Magdeburg, Germany 39106 xiao|roesner@iws.cs.uni-magdeburg.de Abstract In this paper, we investigate the multiword verbs in the English sublanguage of MED- LINE abstracts. Based on the integration of the domain-specific named entity knowledge and syntactic as well as statistical information, this work mainly focuses on how to evaluate a proper multiword verb candidate. Our results present a sound balance between the low- and high-frequency multiword verb candidates in the sublanguage corpus. We get a F-measure of 0.753, when tested on a manual sample subset consisting of multiword candidates with both low- and high-frequencies.