Paper: Natural Language Questions for the Web of Data

ACL ID D12-1035
Title Natural Language Questions for the Web of Data
Venue Conference on Empirical Methods in Natural Language Processing
Session Main Conference
Year 2012

The Linked Data initiative comprises struc- tured databases in the Semantic-Web data model RDF. Exploring this heterogeneous data by structured query languages is tedious and error-prone even for skilled users. To ease the task, this paper presents a methodology for translating natural language questions into structured SPARQL queries over linked-data sources. Our method is based on an integer linear pro- gram to solve several disambiguation tasks jointly: the segmentation of questions into phrases; the mapping of phrases to semantic entities, classes, and relations; and the con- struction of SPARQL triple patterns. Our so- lution harnesses the rich type system provided by knowledge bases in the web of linked data, to constrain our semantic-coherence objective function. We present experime...