Paper: Decompounding query keywords from compounding languages

ACL ID P08-2064
Title Decompounding query keywords from compounding languages
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2008
Authors

Splitting compound words has proved to be useful in areas such as Machine Translation, Speech Recognition or Information Retrieval (IR). Furthermore, real-time IR systems (such as search engines) need to cope with noisy data, as user queries are sometimes written quickly and submitted without review. In this paperweapplyastate-of-the-artprocedurefor German decompounding to other compound- ing languages, and we show that it is possible to have a single decompounding model that is applicable across languages.