Title Multi-document summarization using distortion-rate ratio
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 2014

The current work adapts the optimal tree pruning algorithm(BFOS) introduced by Breiman et al.(1984) and extended by Chou et al.(1989) to the multi-document summarization task. BFOS algorithm is used to eliminate redundancy which is one of the main issues in multi-document sum- marization. Hierarchical Agglomerative Clustering algorithm(HAC) is employed to detect the redundancy. The tree de- signed by HAC algorithm is successively pruned with the optimal tree pruning al- gorithm to optimize the distortion vs. rate cost of the resultant tree. Rate parameter is defined to be the number of the sentences in the leaves of the tree. Distortion is the sum of the distances between the represen- tative sentence of the cluster at each node and the other sentences in the same clus- ter. The sentences ...