Paper: Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation

ACL ID L08-1361
Title Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation
Venue International Conference on Language Resources and Evaluation
Session Main Conference
Year 2008
Authors

ons compatible with the ElixirFM system. The contents of the arising valency lexicon will not be limited to evidence from PADT. The Arabic Gigaword (Graff, 2007)suppliesevenmorerawdataofthenewswiredomain, while the CLARA4 corpus offers documents from literature and other types of texts. The printed dictionaries to be consulted include (e.g. Wehr, 1979; Baalb Graff, 2007 David Graff. Arabic Gigaword Third Edition. LDC2007T40, 158563-460-3, 2007. Jan Hajiˇc Zdeˇnka Ureˇsov´a Linguistic Annotation: from Links to Cross-Layer Lexicons 2003 In Proceedings of The Second Workshop on Treebanks and Linguist...