Paper: Learning To Recognize Tables In Free Text

ACL ID P99-1057
Title Learning To Recognize Tables In Free Text
Venue Annual Meeting of the Association of Computational Linguistics
Session Main Conference
Year 1999

Many real-world texts contain tables. In order to process these texts correctly and extract the infor- mation contained within the tables, it is important to identify the presence and structure of tables. In this paper, we present a new approach that learns to recognize tables in free text, including the bound- ary, rows and columns of tables. When tested on Wall Street Journal news documents, our learning approach outperforms a deterministic table recogni- tion algorithm that identifies tables based on a fixed set of conditions. Our learning approach is also more flexible and easily adaptable to texts in different do- mains with different table characteristics.