Paper: Deriving de/het gender classification for Dutch nouns for rule-based MT generation tasks

ACL ID W14-1014
Title Deriving de/het gender classification for Dutch nouns for rule-based MT generation tasks
Venue Workshop on Hybrid Approaches to Translation
Session
Year 2014
Authors

Abstract Linguistic resources available in the pub-lic domain, such as lemmatisers, part-of-speech taggers and parsers can be used for the development of MT systems: as separate processing modules or as anno-tation tools for the training corpus. For SMT this annotation is used for training factored models, and for the rule-based systems linguistically annotated corpus is the basis for creating analysis, generation and transfer dictionaries from corpora. However, the annotation in many cases is insufficient for rule-based MT, especially for the generation tasks. In this paper we analyze a specific case when the part-of-speech tagger does not provide infor-mation about de/het gender of Dutch nouns that is needed for our rule-based MT systems translating into Dutch. We show that this informa...