Paper: Degraded Text Recognition Using Word Collocation And Visual Inter-Word Constraints

ACL ID A94-1033
Title Degraded Text Recognition Using Word Collocation And Visual Inter-Word Constraints
Venue Applied Natural Language Processing Conference
Session Main Conference
Year 1994
Authors

Given a noisy text page, a word recognizer can generate a set of candidates for each word image. A relaxation algorithm was proposed previously by the authors that uses word collocation statistics to select the candidate for each word that has the highest probability of being the correct decision. Because word collocation is a lo- cal constraint and collocation data trained from corpora are usually incomplete, the algorithm cannot select the correct candidates for some images. To overcome this limitation, contextual information at the image level is now exploited inside the relaxation algorithm. If two word images can match with each other, they should have same symbolic identity. Visual inter-word rela- tions provide a way to link word images in the text and to interpret them systematical...