Paper: Using Document Summarization Techniques for Speech Data Subset Selection

ACL ID N13-1086
Title Using Document Summarization Techniques for Speech Data Subset Selection
Venue Annual Conference of the North American Chapter of the Association for Computational Linguistics
Session Main Conference
Year 2013
Authors

In this paper we leverage methods from sub- modular function optimization developed for document summarization and apply them to the problem of subselecting acoustic data. We evaluate our results on data subset selection for a phone recognition task. Our framework shows significant improvements over random selection and previously proposed methods us- ing a similar amount of resources.