Paper: Automatic Tagging Of Arabic Text: From Raw Text To Base Phrase Chunks

ACL ID N04-4038
Title Automatic Tagging Of Arabic Text: From Raw Text To Base Phrase Chunks
Venue Human Language Technologies
Session Short Paper
Year 2004
Authors

To date, there are no fully automated systems addressing the community’s need for funda- mental language processing tools for Arabic text. In this paper, we present a Support Vector Machine (SVM) based approach to automati- cally tokenize (segmenting off clitics), part-of- speech (POS) tag and annotate base phrases (BPs) in Arabic text. We adapt highly accu- rate tools that have been developed for En- glish text and apply them to Arabic text.