Paper: Inducing Search Keys for Name Filtering

ACL ID D07-1095
Title Inducing Search Keys for Name Filtering
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2007

This paper describes ETK (Ensemble of Transformation-based Keys) a new algo- rithm for inducing search keys for name filtering. ETK has the low computational cost and ability to filter by phonetic sim- ilarity characteristic of phonetic keys such as Soundex, but is adaptable to alternative similarity models. The accuracy of ETK in a preliminary empirical evaluation suggests that it is well-suited for phonetic filtering applications such as recognizing alternative cross-lingual transliterations.