Paper: Automatic Processing Of Large Corpora For The Resolution Of Anaphora References

ACL ID C90-3063
Title Automatic Processing Of Large Corpora For The Resolution Of Anaphora References
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1990
Authors

Manual acquisition of semantic constraints in broad domains is very expensive. This paper presents an automatic scheme for collecting statistics on cooc- currence patterns in a large corpus. To a large ex- tent, these statistics reflect, semantic constraints and thus are used to disambiguate anaphora references and syntactic ambiguities. The scherne was imple- mented by gathering statistics on the output of other linguistic tools. An experiment was performed to resolve references of the pronoun "it" in sentences that were randomly selected from the corpus. Ttle results of the experiment show that in most of the cases the cooccurrence statistics indeed reflect the semantic constraints and thus provide a basis {'or a useful disambiguat.ion tool.