Paper: TSUBAKI: An Open Search Engine Infrastructure for Developing New Information Access Methodology

ACL ID I08-1025
Title TSUBAKI: An Open Search Engine Infrastructure for Developing New Information Access Methodology
Venue International Joint Conference on Natural Language Processing
Session Main Conference
Year 2008
Authors

As the amount of information created by human beings is explosively grown in the last decade, it is getting extremely harder to obtain necessary information by conven- tional information access methods. Hence, creation of drastically new technology is needed. For developing such new technol- ogy, search engine infrastructures are re- quired. Although the existing search engine APIscanberegardedassuchinfrastructures, theseAPIshaveseveralrestrictionssuchasa limit on the number of API calls. To help the development of new technology, we are run- ning an open search engine infrastructure, TSUBAKI, on a high-performance comput- ing environment. In this paper, we describe TSUBAKI infrastructure.