Answer Type Identification for Question Answering

Walker, Andrew D.; Alexopoulos, Panos; Starkey, Andrew; Pan, Jeff Z.; Gómez-Pérez, José Manuel; Siddharthan, Advaith

doi:10.1007/978-3-319-31676-5_17

Andrew D. Walker¹⁷,
Panos Alexopoulos¹⁸,
Andrew Starkey¹⁷,
Jeff Z. Pan¹⁷,
José Manuel Gómez-Pérez¹⁸ &
…
Advaith Siddharthan¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9544))

Included in the following conference series:

Joint International Semantic Technology Conference

891 Accesses
2 Citations

Abstract

Question Answering research has long recognised that the identification of the type of answer being requested is a fundamental step in the interpretation of a question as a whole. Previous strategies have ranged from trivial keyword matches, to statistical analyses, to well-defined algorithms based on shallow syntactic parses with user-interaction for ambiguity resolution. A novel strategy combining deep NLP on both syntactic and dependency parses with supervised learning is introduced and results that improve on extant alternatives reported. The impact of the strategy on QALD is also evaluated with a proprietary Question Answering system and its positive results analysed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
A pre-pre-terminal is a node for which every child is a pre-terminal. A pre-terminal is a node with a single child which is itself a leaf.
2.
A word’s root form without any morphological indications of tense, number, mood etc. E.g., the lemma of ‘children’ is ‘child’, of ‘quickest’ is ‘quick’, of ‘processing’ is ‘process’.
3.
A category to which a word is assigned in accordance with its syntactic function, such as verb, noun and others depending on language. In this study we use POS abbreviations from the Penn Treebank tag set (Marcus et al. 1993).
4.
http://greententacle.techfak.uni-bielefeld.de/cunger/qald/.
5.
http://cogcomp.cs.illinois.edu/Data/QA/QC/.
6.
http://wordnetweb.princeton.edu/perl/webwn?o0=1&o8=1&o1=1&s=long&i=10#c.
7.
http://webscope.sandbox.yahoo.com/catalog.php?datatype=l – L6.
8.
http://www.cs.utexas.edu/users/ml/nldata/geoquery.html.

References

Alexopoulos, P., Walker, A., Gomez-Perez, J.M., Wallace, M.: Towards ontology-based question answering in vague domains. In: 2014 9th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP), pp. 26–31. IEEE (2014)
Google Scholar
Berners-Lee, T., Hendler, J., Lassila, O., et al.: The semantic web. Sci. Am. 284(5), 28–37 (2001)
Article Google Scholar
Bernstein, A., Kaufmann, E., Göhring, A., Kiefer, C.: Querying ontologies: a controlled english interface for end-users. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 112–126. Springer, Heidelberg (2005)
Chapter Google Scholar
Brill, E., Lin, J., Banko, M., Dumais, S., Ng, A., et al.: Data-intensive question answering. In: Proceedings of the Tenth Text REtrieval Conference (TREC 2001) (2001)
Google Scholar
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netwo. ISDN Syst. 30(1), 107–117 (1998)
Article Google Scholar
Buscaldi, D., Rosso, P.: Mining knowledge from wikipedia for the question answering task. In: Proceedings of the International Conference on Language Resources and Evaluation (2006)
Google Scholar
Cer, D.M., De Marneffe, M.-C., Jurafsky, D., Manning, C.D.: Parsing to stanford dependencies: trade-offs between speed and accuracy. In: LREC (2010)
Google Scholar
Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970)
Article MATH Google Scholar
Damljanovic, D., Agatonovic, M., Cunningham, H.: Identification of the question focus: combining syntactic analysis and ontology-based lookup through the user interaction. In: 7th Language Resources and Evaluation Conference (LREC), ELRA, La Valletta, Malta. Citeseer (2010)
Google Scholar
Damljanovic, D., Agatonovic, M., Cunningham, H.: FREyA: an interactive way of querying linked data using natural language. In: García-Castro, R., Fensel, D., Antoniou, G. (eds.) ESWC 2011. LNCS, vol. 7117, pp. 125–138. Springer, Heidelberg (2012)
Chapter Google Scholar
De Marneffe, M.-C., MacCartney, B., Manning, C.D.: Generating typed dependency parses from phrase structure parses. In: Proceedings of LREC, vol. 6, pp. 449–454 (2006)
Google Scholar
Dijkstra, E.W.: A note on two problems in connexion with graphs. Nume. Math. 1(1), 269–271 (1959)
Article MathSciNet MATH Google Scholar
Green Jr., B.F., Wolf, A.K., Chomsky, C., Laughery, K.: Baseball: an automatic question-answerer. In: 1961 Western Joint IRE-AIEE-ACM Computer Conference Papers Presented at the May 9–11, pp. 219–224. ACM (1961)
Google Scholar
Krishnan, V., Das, S., Chakrabarti, S.: Enhanced answer type inference from questions using sequential models. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 315–322. Association for Computational Linguistics (2005)
Google Scholar
Li, X., Roth, D.: Learning question classifiers. In: Proceedings of the 19th International Conference on Computational Linguistics, vol. 1, pp. 1–7. Association for Computational Linguistics (2002)
Google Scholar
Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a large annotated corpus of english: the penn treebank. Comput. Linguist. 19(2), 313–330 (1993). ISSN 0891–2017, URL http://dl.acm.org/citation.cfm?id=972470.972475
Google Scholar
McKeown, K.R.: Paraphrasing questions using given and new information. Comput. Linguist. 9(1), 1–10 (1983)
MathSciNet Google Scholar
Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)
Article Google Scholar
Moldovan, D., Harabagiu, S., Pasca, M., Mihalcea, R., Girju, R., Goodrum, R., Rus, V.: The structure and performance of an open-domain question answering system. In: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, pp. 563–570. Association for Computational Linguistics (2000)
Google Scholar
Prager, J., Chu-Carroll, J., Czuba, K.: Statistical answer-type identification in open-domain question answering. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 150–156. Morgan Kaufmann Publishers Inc. (2002)
Google Scholar
Siddharthan, A.: Syntactic simplification and text cohesion. Res. Lang. Comput. 4(1), 77–109 (2006)
Article Google Scholar
Simmons, R.F.: Answering english questions by computer: a survey. Commun. ACM 8(1), 53–70 (1965). ISSN 0001-0782, doi:10.1145/363707.363732, URL http://doi.acm.org/10.1145/363707.363732
Article Google Scholar
Ullmann, J.R.: An algorithm for subgraph isomorphism. J. ACM (JACM) 23(1), 31–42 (1976)
Article MathSciNet Google Scholar
Wales, J., Sanger, L.: Wikipedia, the free encyclopedia (2001). Accessed April 22, 2013, URL http://en.wikipedia.org/w/index.php?title=Wikipedia&oldid=551616049
Walker, A., Starkey, A., Pan, J.Z., Siddharthan, A.: Making test corpora for question answering more representative. In: Kanoulas, E., Lupu, M., Clough, P., Sanderson, M., Hall, M., Hanbury, A., Toms, E. (eds.) CLEF 2014. LNCS, vol. 8685, pp. 1–6. Springer, Heidelberg (2014)
Google Scholar
Waltz, D.L.: An english language question answering system for a large relational database. Commun. ACM 21(7), 526–539 (1978)
Article MATH Google Scholar
Woods, W.A.: Progress in natural language understanding: an application to lunar geology. In: Proceedings of the June 4–8, National Computer Conference and Exposition, AFIPS 1973, pp. 441–450. ACM, New York (1973). doi:10.1145/1499586.1499695, URL http://doi.acm.org/10.1145/1499586.1499695
Woods, W.A.: Lunar rocks in natural english. Linguist. Struct. Process. 5, 521–569 (1977)
Google Scholar

Download references

Acknowledgement

This research has been partly funded by the European Commission within the 7th Framework Programme/Marie Curie Industry-Academia Partnerships and Pathways schema/PEOPLE Work Programme 2011 project K-Drive number 286348 (cf. http://www.kdrive-project.eu).

Author information

Authors and Affiliations

University of Aberdeen, Aberdeen, UK
Andrew D. Walker, Andrew Starkey, Jeff Z. Pan & Advaith Siddharthan
Expert System, Amsterdam, Netherlands
Panos Alexopoulos & José Manuel Gómez-Pérez

Authors

Andrew D. Walker
View author publications
You can also search for this author in PubMed Google Scholar
Panos Alexopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Starkey
View author publications
You can also search for this author in PubMed Google Scholar
Jeff Z. Pan
View author publications
You can also search for this author in PubMed Google Scholar
José Manuel Gómez-Pérez
View author publications
You can also search for this author in PubMed Google Scholar
Advaith Siddharthan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrew D. Walker .

Editor information

Editors and Affiliations

Southeast University, Nanjing, China
Guilin Qi
Osaka University, Ibaraki, Japan
Kouji Kozaki
The University of Aberdeen, Aberdeen, United Kingdom
Jeff Z. Pan
Zhongnan Hospital of Wuhan University, Wuhan, China
Siwei Yu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Walker, A.D., Alexopoulos, P., Starkey, A., Pan, J.Z., Gómez-Pérez, J.M., Siddharthan, A. (2016). Answer Type Identification for Question Answering. In: Qi, G., Kozaki, K., Pan, J., Yu, S. (eds) Semantic Technology. JIST 2015. Lecture Notes in Computer Science(), vol 9544. Springer, Cham. https://doi.org/10.1007/978-3-319-31676-5_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-31676-5_17
Published: 20 March 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31675-8
Online ISBN: 978-3-319-31676-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics