Paper: A System For Creating And Manipulating Generalized Wordclass Transition Matrices From Large Labelled Text-Corpora

ACL ID C88-1011
Title A System For Creating And Manipulating Generalized Wordclass Transition Matrices From Large Labelled Text-Corpora
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1988
Authors

This paper deals with the training phase of a Markov-type linguistic model that is based on transition probabilities between pvirs and triplets of syntactic categories. To deter- mine the o?timal level of detail for a set of syntactic classes we developed a systetn that uses a set-theoretical formalism to defiue such sets mid has some measm~s to comp~uce and c,ptimize them fildividually. In section two we describe the optimizafiou problem (hi terms of piediction, infoimation and economy requilements) and our approach to its solution. Section three introduces the system dlat will assist a lhlguist in h,'mdling the prediction and economy criteria and in the last section we plesent some slunple lemtlts that can be achieved with it. I. IN'fRODUCrlON The context in which we strutted devclopping...