Paper: Document Classification By Machine: Theory And Practice

ACL ID C94-2172
Title Document Classification By Machine: Theory And Practice
Venue International Conference on Computational Linguistics
Session Main Conference
Year 1994
Authors

In this note, we present results concerning the the- ory and practice of determining for a given document which of several categories it best fits. We describe a mathematical model of classification schemes and the one scheme which can be proved optimal among all those based on word frequencies. Finally, we report the results of an experiment which illustrates the effi- cacy of this classification method.