Paper: Development Implementation And Testing Of A Discourse Model For Newspaper Texts

ACL ID H93-1031
Title Development Implementation And Testing Of A Discourse Model For Newspaper Texts
Venue Human Language Technologies
Session Main Conference
Year 1993
Authors

Texts of a particular type evidence a discernible, predictable schema. These schemata can be delineated, and as such provide models of their respective text-types which are of use in automatically structuring texts. We have developed a Text Structurer module which recognizes text-level structure for use within a larger information retrieval system to delineate the discourse-level organization of each document's contents. This allows those document components which are more likely to contain the type of information suggested by the user's query to be selected for higher weighting. We chose newspaper text as the first text type to implement. Several iterations of manually coding a randomly chosen sample of newspaper articles enabled us to develop a newspaper text model. This process suggeste...