Paper: A Probabilistic Approach To Grammatical Analysis Of Written English By Computer

ACL ID E85-1023
Title A Probabilistic Approach To Grammatical Analysis Of Written English By Computer
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 1985
Authors

Work at the Unit for Computer Research on the Eaglish Language at the University of Lancaster has been directed towards producing a grammatically s nnotated version of the Lancaster-Oslo/ Bergen (LOB) Corpus of written British English texts as the prel~minary stage in developing computer programs and data files for providing a grammatical analysis of -n~estricted English text. From 1981-83, a suite of PASCAL programs was devised to automatically produce a single level of grammatical description with one word tag representing the word class or part of speech of each word token in the corpus. Error analysis and subsequent modification to the system resulted in over 96 per cent of word tags being correctly assigned automatically. The remaining 3 to ~ per cent were corrected by human post-edit...