Paper: Identifying Phrasal Verbs Using Many Bilingual Corpora

ACL ID D13-1060
Title Identifying Phrasal Verbs Using Many Bilingual Corpora
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2013
Authors

We address the problem of identifying mul- tiword expressions in a language, focus- ing on English phrasal verbs. Our poly- glot ranking approach integrates frequency statistics from translated corpora in 50 dif- ferent languages. Our experimental eval- uation demonstrates that combining statisti- cal evidence from many parallel corpora us- ing a novel ranking-oriented boosting algo- rithm produces a comprehensive set of English phrasal verbs, achieving performance compa- rable to a human-curated set.