Paper: Annotating 200 Million Words: The Bank Of English Project

ACL ID C94-1092
Title Annotating 200 Million Words: The Bank Of English Project
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1994
Authors

The llank of English is an international English hm- guage project sponsored by llarper-Collins Publish- ers, Glasgow, and conducl;ed by the COBUILD team at the University of Birrnhlgham, UK. The text hank will comprise some 200 million words of both written and spoken English. The whole 200 million word col pns is being annotated morphologically and syntacti- cally during 1993-94 at the Research Unit for Cor,,- Imtational Linguistics (IL/I(3L), University of Ilel- sinkl, using the Fmglish nmrphological analyser (ENC,- TW()I,) and English Constraint (:h'ammar (EN(:I(:.'(:~) parser. The first half of the texts (103 million words) has ah'eady been processed in 1993. The project is lead by Prof. 3ohn Sinchdr in Birmingham, and l'rof. Fred Karlsson in Ilelsinld. The present author is re- spons...