Paper: Efficient Parsing Strategies For Syntactic Analysis Of Closed Captions

ACL ID A00-3002
Title Efficient Parsing Strategies For Syntactic Analysis Of Closed Captions
Venue Annual Conference of the North American Chapter of the Association for Computational Linguistics
Session Student Session
Year 2000
Authors

We present an efficient multi-level chart parser that was designed for syntactic analysis of closed captions (subtitles) in a real-time Machine Translation (MT) system. In order to achieve high parsing speed, we divided an existing English grammar into multiple levels. The parser proceeds in stages. At each stage, rules corresponding to only one level are used. A constituent pruning step is added between levels to insure that constituents not likely to be part of the final parse are removed. This results in a significant parse time and ambiguity reduction. Since the do- main is unrestricted, out-of-coverage sentences are to be expected and the parser might not produce a sin- gle analysis spanning the whole input. Despite the incomplete parsing strategy and the radical prun- ing, the initia...