Paper: Natural Language Text Segmentation Techniques Applied To The Automatic Compilation Of Printed Subject Indexes And For Online Database Access

ACL ID A83-1023
Title Natural Language Text Segmentation Techniques Applied To The Automatic Compilation Of Printed Subject Indexes And For Online Database Access
Venue Applied Natural Language Processing Conference
Session Main Conference
Year 1983
Authors
  • G. Vladutz (Institute for Scientific Information, Philadelphia PA)

The nature of the problem and earlier approaches to the automatic compilation of printed subject indexes are reviewed and illustrated. A simple method is described for the de~ection of semantically self-contained word phrase segments in title-like texts. The method is based on a predetermined list of acceptable types of nominative syntactic patterns which can be recognized using a small domain-indepen- dent dictionary. The transformation of the de~ected word phrases into subject index records is described. The records are used for ~he compilation of Key Word Phrase subJec= indexes (K~PSI). The me~hod has been successfully tested for the fully automatic production of KWPSI-type indexes to titles of scientific publications. The usage of KWPSI-type display forma~s for the~enhanced online acce...