Paper: SHOGUN - Multilingual Data Extraction For TIPSTER

ACL ID H93-1089
Title SHOGUN - Multilingual Data Extraction For TIPSTER
Venue Human Language Technologies
Session Main Conference
Year 1993
Authors
  • Paul S. Jacobs (GE Corporate Research & Development, Schenectady NY)

rules. While the performance on all tasks still lags behind hu- man analysts, closing this gap may not be as hard as we first expected. Much of the difference comes from portions of the work that are still incomplete. In ad- dition, the ability to use automatically-acquired corpus data gives the programs a distinct advantage on certain portions of the task. PLANS FOR THE COMING YEAR As the project nears completion, the team is approach- ing the goal of near-human accuracy mostly by finishing certain key details, such as better reference resolution and word sense discrimination. At the same time, we are close to some significant advances in corpus-based training methods that will not only isolate the context required to discriminate nuances of meaning but also sig- nificantly reduce develop...