Paper: Combination Of N-Grams And Stochastic Context-Free Grammars For Language Modeling

ACL ID C00-1009
Title Combination Of N-Grams And Stochastic Context-Free Grammars For Language Modeling
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2000
Authors

This t)al)t;r de, scribes a hybrid prol)osal to combine n-grams and Stochastic Context-Free Grammars (SCFGs) tbr language modeling. A classical n-gram model is used to cat)lure the local relations between words, while a stochas- tic grammatical inodel is considered to repre- sent the hmg-term relations between syntactical stru(:tm'es. In order to define this granmlatical model, which will 1)e used on large-vo(:almlary comph'~x tasks, a eategory-t)ased SCFG and a prol)abilisti(" model of' word (tistrilmtion in the categories have been 1)rol)osed. Methods for leanfing these stochastic models tTor complex tasks are described, and algorithms for con> puting the word transition probal)ilities are also 1)resented. Filmily, ext)erilnents using the Penn Treel)ank corpus improved by 30% the test; s...