Paper: Lightweight Client-Side Chinese/Japanese Morphological Analyzer Based on Online Learning

ACL ID C14-2009
Title Lightweight Client-Side Chinese/Japanese Morphological Analyzer Based on Online Learning
Venue International Conference on Computational Linguistics
Session Main Conference
Year 2014
Authors

As mobile devices and Web applications become popular, lightweight, client-side language analysis is more important than ever. We propose Rakuten MA, a Chinese/Japanese morphological analyzer written in JavaScript. It employs an online learning algorithm SCW, which enables client-side model update and domain adaptation. We have achieved a compact model size (5MB) while maintaining the state-of-the-art performance, via techniques such as feature hashing, FOBOS, and feature quantization.