ABSTRACT
Knowledge bases and the Web of Linked Data have become important assets for search, recommendation, and analytics. Natural-language questions are a user-friendly mode of tapping this wealth of knowledge and data. However, question answering technology does not work robustly in this setting as questions have to be translated into structured queries and users have to be careful in phrasing their questions. This paper advocates a new approach that allows questions to be partially translated into relaxed queries, covering the essential but not necessarily all aspects of the user's input. To compensate for the omissions, we exploit textual sources associated with entities and relational facts. Our system translates user questions into an extended form of structured SPARQL queries, with text predicates attached to triple patterns. Our solution is based on a novel optimization model, cast into an integer linear program, for joint decomposition and disambiguation of the user question. We demonstrate the quality of our methods through experiments with the QALD benchmark.
- Sanjay Agrawal, Surajit Chaudhuri, and Gautam Das DBXplorer: A System for Keyword-Based Search over Relational Databases. In ICDE, 2002.Google ScholarCross Ref
- Nitish Aggarwal. Cross Lingual Semantic Search by Improving Semantic Similarity and Relatedness Measures. In ISWC, 2012. Google ScholarDigital Library
- Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary G. Ives. DBpedia: A Nucleus for a Web of Open Data. In ISWC, 2007. Google ScholarDigital Library
- Krisztian Balog, Leif Azzopardi, and Maarten de Rijke. A language modeling framework for expert finding. Inf. Process. Manage. 45(1):1--19. Google ScholarDigital Library
- Krisztian Balog, Leif Azzopardi, and Maarten de Rijke. Overview of the TREC 2011 Entity Track. In TREC, 2011.Google Scholar
- Krisztian Balog, Yi Fang, Maarten de Rijke, Pavel Serdyukov, and Luo Si. Expertise Retrieval. Foundations and Trends in Information Retrieval 6(2--3):127--256. Google ScholarDigital Library
- Gaurav Bhalotia, Arvind Hulgeri, Charuta Nakhe, Soumen Chakrabarti, S. Sudarshan Keyword Searching and Browsing in Databases using BANKS. In ICDE, 2002.Google ScholarCross Ref
- Stefan Büttcher, Charles L. A. Clarke, and Gordon V. Cormack. Information Retrieval: Implementing and Evaluating Search Engines. MIT Press, 2010. Google ScholarDigital Library
- Elena Cabrio, Julien Cojan, Alessio Palmero Aprosio, Bernardo Magnini, Alberto Lavelli, and Fabien Gandon. QAKiS: an Open Domain QA System based on Relational Patterns. In ILD, 2012.Google Scholar
- Jennifer Chu-Carroll, James Fan, Branimir Boguraev, David Carmel, Dafna Sheinwald, and Chris Welty.Welty, C. Finding needles in the haystack: Search and candidate generation. IBM Journal of Research and Development 56(3):6. Google ScholarDigital Library
- Marek Ciglan, Kjetil Nørvåg, and Ladislav Hluchý. The SemSets model for ad-hoc semantic list search. In WWW, 2012. Google ScholarDigital Library
- Hoa Trang Dang, Diane Kelly, and Jimmy J. Lin. Overview of the TREC 2007 question answering track. In TREC, 2007.Google Scholar
- Shady Elbassuoni, Maya Ramanath, Ralf Schenkel, Marcin Sydow, and Gerhard Weikum. Language-model-based ranking for queries on rdf-graphs. In CIKM, 2009. Google ScholarDigital Library
- Shady Elbassuoni, Maya Ramanath, and Gerhard Weikum. Query relaxation for entity-relationship search. In ESWC, 2011. Google ScholarDigital Library
- Anette Frank, Hans-Ulrich Krieger, Feiyu Xu, Hans Uszkoreit, Berthold Crysmann, Brigitte Jörg, and Ulrich Schäfer. Question Answering from Structured Knowledge Sources. Journal of Applied Logic 5(1):20--48.Google ScholarCross Ref
- Anthony Fader, Stephen Soderland, and Oren Etzioni. Identifying relations for open information extraction. In EMNLP, 2011. Google ScholarDigital Library
- Hui Fang and ChengXiang Zhai. Probabilistic Models for Expert Finding. In ECIR, 2007. Google ScholarDigital Library
- Vagelis Hristidis and Yannis Papakonstantinou. DISCOVER: Keyword Search in Relational Databases. In VLDB, 2002. Google ScholarDigital Library
- Vagelis Hristidis, Heasoo Hwang, and Yannis Papakonstantinou. Authority-based keyword search in databases. ACM Trans. Database Syst. 33(1):1--40. Google ScholarDigital Library
- Hao He, Haixun Wang, Jun Yang, and Philip S. Yu. BLINKS: ranked keyword searches on graphs. In SIGMOD, 2007. Google ScholarDigital Library
- Heath, T., and Bizer, C. Linked Data: Evolving the Web into a Global Data Space. Morgan & Claypool Publishers, 2011. Google ScholarDigital Library
- Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, and Gerhard Weikum. YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. Artificial Intelligence 194: 28--61. Google ScholarDigital Library
- IBM 2012. Special Issue on "This is Watson" IBM Journal of Research and Development 56(3/4).Google Scholar
- Aditya Kalyanpur, J. William Murdock, James Fan, and Christopher A. Welty. Leveraging community-built knowledge for type coercion in question answering. In ISWC, 2011. Google ScholarDigital Library
- Boris Katz, Sue Felshin, Gregory Marton, Federico Mora, Yuan Kui Shen, Gabriel Zaccak, Ammar Ammar, Eric Eisner, Asli Turgut, and L. Brown Westrick: CSAIL at TREC 2007 Question Answering. In TREC, 2007.Google Scholar
- Cody C. T. Kwok, Oren Etzioni, and Daniel S. Weld. Scaling Question Answering to the Web. In WWW, 2001. Google ScholarDigital Library
- Lin, J. J.; An exploration of the principles underlying redundancy-based factoid question answering. ACM Transactions on Information Systems 25(2):1--48. Google ScholarDigital Library
- Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008. Google ScholarDigital Library
- Marie-Catherine de Marneffe, Bill MacCartney, and Christopher D. Manning. Generating typed dependency parses from phrase structure parses. In LREC, 2006.Google Scholar
- Ndapandula Nakashole, Gerhard Weikum, and Fabian M. Suchanek PATTY: A Taxonomy of Relational Patterns with Semantic Types. In EMNLP, 2012. Google ScholarDigital Library
- Zaiqing Nie, Yunxiao Ma, Shuming Shi, Ji-Rong Wen, and Wei-Ying Ma. Web object retrieval. In WWW, 2007. Google ScholarDigital Library
- Anselmo Penas, Eduard H. Hovy, Pamela Forner, Álvaro Rodrigo, Richard F. E. Sutcliffe, Caroline Sporleder, Corina Forascu, Yassine Benajiba, and Petya Osenova. Overview of QA4MRE at CLEF 2012: Question Answering for Machine Reading Evaluation. In CLEF Evaluation Labs and Workshop, 2012.Google Scholar
- Jeffrey Pound, Ihab F. Ilyas, and Grant E. Weddell. 2010. Expressive and Flexible Access to Web-extracted Data: A Keyword-based Structured Query Language. In SIGMOD, 2010. Google ScholarDigital Library
- Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, and Grant E. Weddell. Interpreting keyword queries over web knowledge bases. In CIKM, 2012. Google ScholarDigital Library
- Second Workshop on Question Answering over Linked Data (QALD-2).2012. http://www.sc.cit-ec.uni-bielefeld.de/qald-2.Google Scholar
- Deepak Ravichandran and Eduard H. Hovy. Learning surface text patterns for a Question Answering System. In ACL, 2002. Google ScholarDigital Library
- Uma Sawant and Soumen Chakrabarti Learning Joint Query Interpretation and Response Ranking. CoRR abs/1212.6193Google Scholar
- Pavel Serdyukov, Henning Rode, and Djoerd Hiemstra. Modeling expert finding as an absorbing random walk In SIGIR, 2008. Google ScholarDigital Library
- Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. Yago: a core of semantic knowledge. In WWW, 2007. Google ScholarDigital Library
- Christina Unger, Lorenz Bühmann, Jens Lehmann, Axel-Cyrille Ngonga Ngomo, Daniel Gerber, and Philipp Cimiano. Template-based question answering over RDF data. In WWW, 2012. Google ScholarDigital Library
- Christina Unger, Philipp Cimiano, Vanessa Lopez, Enrico Motta, Paul Buitelaar, and Richard Cyganiak. ILD 2012. http://ceur-ws.org/Vol-913/.Google Scholar
- David Vallet and Hugo Zaragoza. Inferring the most important types of a query: a semantic approach. In SIGIR, 2008. Google ScholarDigital Library
- Ellen M. Voorhees. Overview of the TREC 2003 question answering track. In TREC, 2003.Google Scholar
- Sebastian Walter, Christina Unger, Philipp Cimiano, and Daniel Bär. Evaluation of a Layered Approach to Question Answering over Linked Data In ISWC, 2012. Google ScholarDigital Library
- Qiuyue Wang, Jaap Kamps, Georgina Ramirez Camps, Maarten Marx, Anne Schuth, Martin Theobald, Sairam Gurajada, and Arunav Mishra. Overview of the INEX 2012 Linked Data Track. In CLEF (Online Working Notes/Labs/Workshop), 2012.Google Scholar
- Mohamed Yahya, Klaus Berberich, Shady Elbassuoni, Maya Ramanath, Volker Tresp, and Gerhard Weikum. Natural Language Questions for the Web of Data. In EMNLP, 2012. Google ScholarDigital Library
- Jeffrey Xu Yu, Lu Qin, and Lijun Chang phKeyword Search in Databases. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2010. Google ScholarDigital Library
- ChengXiang Zhai. phStatistical Language Models for Information Retrieval. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2008. Google ScholarDigital Library
Index Terms
- Robust question answering over the web of linked data
Recommendations
Template-based question answering over RDF data
WWW '12: Proceedings of the 21st international conference on World Wide WebAs an increasing amount of RDF data is published as Linked Data, intuitive ways of accessing this data become more and more important. Question answering approaches have been proposed as a good compromise between intuitiveness and expressivity. Most ...
Deep answers for naturally asked questions on the web of data
WWW '12 Companion: Proceedings of the 21st International Conference on World Wide WebWe present DEANNA, a framework for natural language question answering over structured knowledge bases. Given a natural language question, DEANNA translates questions into a structured SPARQL query that can be evaluated over knowledge bases such as Yago,...
Semantic question answering system over linked data using relational patterns
EDBT '13: Proceedings of the Joint EDBT/ICDT 2013 WorkshopsQuestion answering is the task of answering questions in natural language. Linked Data project and Semantic Web community made it possible for us to query structured knowledge bases like DBpedia and YAGO. Only expert users, however, with the knowledge ...
Comments