Paper: Using idiolects and sociolects to improve word prediction

ACL ID E14-1034
Title Using idiolects and sociolects to improve word prediction
Venue Annual Meeting of The European Chapter of The Association of Computational Linguistics
Session Main Conference
Year 2014

In this paper the word prediction system Soothsayer 1 is described. This system pre- dicts what a user is going to write as he is keying it in. The main innovation of Soothsayer is that it not only uses idi- olects, the language of one individual per- son, as its source of knowledge, but also sociolects, the language of the social cir- cle around that person. We use Twitter for data collection and experimentation. The idiolect models are based on individ- ual Twitter feeds, the sociolect models are based on the tweets of a particular person and the tweets of the people he often com- municates with. The idea behind this is that people who often communicate start to talk alike; therefore the language of the friends of person x can be helpful in try- ing to predict what person x is going to s...