research-article

Robust question answering over the web of linked data

Authors:
Mohamed Yahya

Max Planck Institute for Informatics, Saarbruecken, Germany

Max Planck Institute for Informatics, Saarbruecken, Germany
View Profile

,
Klaus Berberich

Max Planck Institute for Informatics, Saarbruecken, Germany

Max Planck Institute for Informatics, Saarbruecken, Germany
View Profile

,
Shady Elbassuoni

American University of Beirut, Beirut, Lebanon

American University of Beirut, Beirut, Lebanon
View Profile

,
Gerhard Weikum

Max Planck Institute for Informatics, Saarbruecken, Germany

Max Planck Institute for Informatics, Saarbruecken, Germany
View Profile

CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge ManagementOctober 2013Pages 1107–1116https://doi.org/10.1145/2505515.2505677

Published:27 October 2013Publication History

CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge Management

Pages 1107–1116

ABSTRACT

Knowledge bases and the Web of Linked Data have become important assets for search, recommendation, and analytics. Natural-language questions are a user-friendly mode of tapping this wealth of knowledge and data. However, question answering technology does not work robustly in this setting as questions have to be translated into structured queries and users have to be careful in phrasing their questions. This paper advocates a new approach that allows questions to be partially translated into relaxed queries, covering the essential but not necessarily all aspects of the user's input. To compensate for the omissions, we exploit textual sources associated with entities and relational facts. Our system translates user questions into an extended form of structured SPARQL queries, with text predicates attached to triple patterns. Our solution is based on a novel optimization model, cast into an integer linear program, for joint decomposition and disambiguation of the user question. We demonstrate the quality of our methods through experiments with the QALD benchmark.

References

Sanjay Agrawal, Surajit Chaudhuri, and Gautam Das DBXplorer: A System for Keyword-Based Search over Relational Databases. In ICDE, 2002.Google ScholarCross Ref
Nitish Aggarwal. Cross Lingual Semantic Search by Improving Semantic Similarity and Relatedness Measures. In ISWC, 2012. Google ScholarDigital Library
Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary G. Ives. DBpedia: A Nucleus for a Web of Open Data. In ISWC, 2007. Google ScholarDigital Library
Krisztian Balog, Leif Azzopardi, and Maarten de Rijke. A language modeling framework for expert finding. Inf. Process. Manage. 45(1):1--19. Google ScholarDigital Library
Krisztian Balog, Leif Azzopardi, and Maarten de Rijke. Overview of the TREC 2011 Entity Track. In TREC, 2011.Google Scholar
Krisztian Balog, Yi Fang, Maarten de Rijke, Pavel Serdyukov, and Luo Si. Expertise Retrieval. Foundations and Trends in Information Retrieval 6(2--3):127--256. Google ScholarDigital Library
Gaurav Bhalotia, Arvind Hulgeri, Charuta Nakhe, Soumen Chakrabarti, S. Sudarshan Keyword Searching and Browsing in Databases using BANKS. In ICDE, 2002.Google ScholarCross Ref
Stefan Büttcher, Charles L. A. Clarke, and Gordon V. Cormack. Information Retrieval: Implementing and Evaluating Search Engines. MIT Press, 2010. Google ScholarDigital Library
Elena Cabrio, Julien Cojan, Alessio Palmero Aprosio, Bernardo Magnini, Alberto Lavelli, and Fabien Gandon. QAKiS: an Open Domain QA System based on Relational Patterns. In ILD, 2012.Google Scholar
Jennifer Chu-Carroll, James Fan, Branimir Boguraev, David Carmel, Dafna Sheinwald, and Chris Welty.Welty, C. Finding needles in the haystack: Search and candidate generation. IBM Journal of Research and Development 56(3):6. Google ScholarDigital Library
Marek Ciglan, Kjetil Nørvåg, and Ladislav Hluchý. The SemSets model for ad-hoc semantic list search. In WWW, 2012. Google ScholarDigital Library
Hoa Trang Dang, Diane Kelly, and Jimmy J. Lin. Overview of the TREC 2007 question answering track. In TREC, 2007.Google Scholar
Shady Elbassuoni, Maya Ramanath, Ralf Schenkel, Marcin Sydow, and Gerhard Weikum. Language-model-based ranking for queries on rdf-graphs. In CIKM, 2009. Google ScholarDigital Library
Shady Elbassuoni, Maya Ramanath, and Gerhard Weikum. Query relaxation for entity-relationship search. In ESWC, 2011. Google ScholarDigital Library
Anette Frank, Hans-Ulrich Krieger, Feiyu Xu, Hans Uszkoreit, Berthold Crysmann, Brigitte Jörg, and Ulrich Schäfer. Question Answering from Structured Knowledge Sources. Journal of Applied Logic 5(1):20--48.Google ScholarCross Ref
Anthony Fader, Stephen Soderland, and Oren Etzioni. Identifying relations for open information extraction. In EMNLP, 2011. Google ScholarDigital Library
Hui Fang and ChengXiang Zhai. Probabilistic Models for Expert Finding. In ECIR, 2007. Google ScholarDigital Library
Vagelis Hristidis and Yannis Papakonstantinou. DISCOVER: Keyword Search in Relational Databases. In VLDB, 2002. Google ScholarDigital Library
Vagelis Hristidis, Heasoo Hwang, and Yannis Papakonstantinou. Authority-based keyword search in databases. ACM Trans. Database Syst. 33(1):1--40. Google ScholarDigital Library
Hao He, Haixun Wang, Jun Yang, and Philip S. Yu. BLINKS: ranked keyword searches on graphs. In SIGMOD, 2007. Google ScholarDigital Library
Heath, T., and Bizer, C. Linked Data: Evolving the Web into a Global Data Space. Morgan & Claypool Publishers, 2011. Google ScholarDigital Library
Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, and Gerhard Weikum. YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. Artificial Intelligence 194: 28--61. Google ScholarDigital Library
IBM 2012. Special Issue on "This is Watson" IBM Journal of Research and Development 56(3/4).Google Scholar
Aditya Kalyanpur, J. William Murdock, James Fan, and Christopher A. Welty. Leveraging community-built knowledge for type coercion in question answering. In ISWC, 2011. Google ScholarDigital Library
Boris Katz, Sue Felshin, Gregory Marton, Federico Mora, Yuan Kui Shen, Gabriel Zaccak, Ammar Ammar, Eric Eisner, Asli Turgut, and L. Brown Westrick: CSAIL at TREC 2007 Question Answering. In TREC, 2007.Google Scholar
Cody C. T. Kwok, Oren Etzioni, and Daniel S. Weld. Scaling Question Answering to the Web. In WWW, 2001. Google ScholarDigital Library
Lin, J. J.; An exploration of the principles underlying redundancy-based factoid question answering. ACM Transactions on Information Systems 25(2):1--48. Google ScholarDigital Library
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008. Google ScholarDigital Library
Marie-Catherine de Marneffe, Bill MacCartney, and Christopher D. Manning. Generating typed dependency parses from phrase structure parses. In LREC, 2006.Google Scholar
Ndapandula Nakashole, Gerhard Weikum, and Fabian M. Suchanek PATTY: A Taxonomy of Relational Patterns with Semantic Types. In EMNLP, 2012. Google ScholarDigital Library
Zaiqing Nie, Yunxiao Ma, Shuming Shi, Ji-Rong Wen, and Wei-Ying Ma. Web object retrieval. In WWW, 2007. Google ScholarDigital Library
Anselmo Penas, Eduard H. Hovy, Pamela Forner, Álvaro Rodrigo, Richard F. E. Sutcliffe, Caroline Sporleder, Corina Forascu, Yassine Benajiba, and Petya Osenova. Overview of QA4MRE at CLEF 2012: Question Answering for Machine Reading Evaluation. In CLEF Evaluation Labs and Workshop, 2012.Google Scholar
Jeffrey Pound, Ihab F. Ilyas, and Grant E. Weddell. 2010. Expressive and Flexible Access to Web-extracted Data: A Keyword-based Structured Query Language. In SIGMOD, 2010. Google ScholarDigital Library
Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, and Grant E. Weddell. Interpreting keyword queries over web knowledge bases. In CIKM, 2012. Google ScholarDigital Library
Second Workshop on Question Answering over Linked Data (QALD-2).2012. http://www.sc.cit-ec.uni-bielefeld.de/qald-2.Google Scholar
Deepak Ravichandran and Eduard H. Hovy. Learning surface text patterns for a Question Answering System. In ACL, 2002. Google ScholarDigital Library
Uma Sawant and Soumen Chakrabarti Learning Joint Query Interpretation and Response Ranking. CoRR abs/1212.6193Google Scholar
Pavel Serdyukov, Henning Rode, and Djoerd Hiemstra. Modeling expert finding as an absorbing random walk In SIGIR, 2008. Google ScholarDigital Library
Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. Yago: a core of semantic knowledge. In WWW, 2007. Google ScholarDigital Library
Christina Unger, Lorenz Bühmann, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Daniel Gerber, and Philipp Cimiano. Template-based question answering over RDF data. In WWW, 2012. Google ScholarDigital Library
Christina Unger, Philipp Cimiano, Vanessa Lopez, Enrico Motta, Paul Buitelaar, and Richard Cyganiak. ILD 2012. http://ceur-ws.org/Vol-913/.Google Scholar
David Vallet and Hugo Zaragoza. Inferring the most important types of a query: a semantic approach. In SIGIR, 2008. Google ScholarDigital Library
Ellen M. Voorhees. Overview of the TREC 2003 question answering track. In TREC, 2003.Google Scholar
Sebastian Walter, Christina Unger, Philipp Cimiano, and Daniel Bär. Evaluation of a Layered Approach to Question Answering over Linked Data In ISWC, 2012. Google ScholarDigital Library
Qiuyue Wang, Jaap Kamps, Georgina Ramirez Camps, Maarten Marx, Anne Schuth, Martin Theobald, Sairam Gurajada, and Arunav Mishra. Overview of the INEX 2012 Linked Data Track. In CLEF (Online Working Notes/Labs/Workshop), 2012.Google Scholar
Mohamed Yahya, Klaus Berberich, Shady Elbassuoni, Maya Ramanath, Volker Tresp, and Gerhard Weikum. Natural Language Questions for the Web of Data. In EMNLP, 2012. Google ScholarDigital Library
Jeffrey Xu Yu, Lu Qin, and Lijun Chang phKeyword Search in Databases. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2010. Google ScholarDigital Library
ChengXiang Zhai. phStatistical Language Models for Information Retrieval. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2008. Google ScholarDigital Library

Index Terms

Robust question answering over the web of linked data

Recommendations

Template-based question answering over RDF data
WWW '12: Proceedings of the 21st international conference on World Wide Web

As an increasing amount of RDF data is published as Linked Data, intuitive ways of accessing this data become more and more important. Question answering approaches have been proposed as a good compromise between intuitiveness and expressivity. Most ...
Read More
Deep answers for naturally asked questions on the web of data
WWW '12 Companion: Proceedings of the 21st International Conference on World Wide Web

We present DEANNA, a framework for natural language question answering over structured knowledge bases. Given a natural language question, DEANNA translates questions into a structured SPARQL query that can be evaluated over knowledge bases such as Yago,...
Read More
Semantic question answering system over linked data using relational patterns
EDBT '13: Proceedings of the Joint EDBT/ICDT 2013 Workshops

Question answering is the task of answering questions in natural language. Linked Data project and Semantic Web community made it possible for us to query structured knowledge bases like DBpedia and YAGO. Only expert users, however, with the knowledge ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge Management
October 2013
2612 pages
ISBN:9781450322638
DOI:10.1145/2505515
General Chairs:
Qi He
LinkedIn, USA
,
Arun Iyengar
IBM T.J. Watson Research Center, USA
,
Program Chairs:
Wolfgang Nejdl
L3S Research Center, Germany
,
Jian Pei
Simon Fraser University, Canada
,
Rajeev Rastogi
Amazon, India
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 October 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
disambiguation
knowledge base
question answering
semantic search
usability
Qualifiers
- research-article
Conference

Acceptance Rates
CIKM '13 Paper Acceptance Rate143of848submissions,17%Overall Acceptance Rate1,861of8,427submissions,22%
More
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 52
  Total Citations
  View Citations
- 601
  Total Downloads
- Downloads (Last 12 months)13
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Robust question answering over the web of linked data

CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge Management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Template-based question answering over RDF data

Deep answers for naturally asked questions on the web of data

Semantic question answering system over linked data using relational patterns