Abstract
Question Answering research has long recognised that the identification of the type of answer being requested is a fundamental step in the interpretation of a question as a whole. Previous strategies have ranged from trivial keyword matches, to statistical analyses, to well-defined algorithms based on shallow syntactic parses with user-interaction for ambiguity resolution. A novel strategy combining deep NLP on both syntactic and dependency parses with supervised learning is introduced and results that improve on extant alternatives reported. The impact of the strategy on QALD is also evaluated with a proprietary Question Answering system and its positive results analysed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
A pre-pre-terminal is a node for which every child is a pre-terminal. A pre-terminal is a node with a single child which is itself a leaf.
- 2.
A word’s root form without any morphological indications of tense, number, mood etc. E.g., the lemma of ‘children’ is ‘child’, of ‘quickest’ is ‘quick’, of ‘processing’ is ‘process’.
- 3.
A category to which a word is assigned in accordance with its syntactic function, such as verb, noun and others depending on language. In this study we use POS abbreviations from the Penn Treebank tag set (Marcus et al. 1993).
- 4.
- 5.
- 6.
- 7.
- 8.
References
Alexopoulos, P., Walker, A., Gomez-Perez, J.M., Wallace, M.: Towards ontology-based question answering in vague domains. In: 2014 9th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP), pp. 26–31. IEEE (2014)
Berners-Lee, T., Hendler, J., Lassila, O., et al.: The semantic web. Sci. Am. 284(5), 28–37 (2001)
Bernstein, A., Kaufmann, E., Göhring, A., Kiefer, C.: Querying ontologies: a controlled english interface for end-users. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 112–126. Springer, Heidelberg (2005)
Brill, E., Lin, J., Banko, M., Dumais, S., Ng, A., et al.: Data-intensive question answering. In: Proceedings of the Tenth Text REtrieval Conference (TREC 2001) (2001)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netwo. ISDN Syst. 30(1), 107–117 (1998)
Buscaldi, D., Rosso, P.: Mining knowledge from wikipedia for the question answering task. In: Proceedings of the International Conference on Language Resources and Evaluation (2006)
Cer, D.M., De Marneffe, M.-C., Jurafsky, D., Manning, C.D.: Parsing to stanford dependencies: trade-offs between speed and accuracy. In: LREC (2010)
Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970)
Damljanovic, D., Agatonovic, M., Cunningham, H.: Identification of the question focus: combining syntactic analysis and ontology-based lookup through the user interaction. In: 7th Language Resources and Evaluation Conference (LREC), ELRA, La Valletta, Malta. Citeseer (2010)
Damljanovic, D., Agatonovic, M., Cunningham, H.: FREyA: an interactive way of querying linked data using natural language. In: García-Castro, R., Fensel, D., Antoniou, G. (eds.) ESWC 2011. LNCS, vol. 7117, pp. 125–138. Springer, Heidelberg (2012)
De Marneffe, M.-C., MacCartney, B., Manning, C.D.: Generating typed dependency parses from phrase structure parses. In: Proceedings of LREC, vol. 6, pp. 449–454 (2006)
Dijkstra, E.W.: A note on two problems in connexion with graphs. Nume. Math. 1(1), 269–271 (1959)
Green Jr., B.F., Wolf, A.K., Chomsky, C., Laughery, K.: Baseball: an automatic question-answerer. In: 1961 Western Joint IRE-AIEE-ACM Computer Conference Papers Presented at the May 9–11, pp. 219–224. ACM (1961)
Krishnan, V., Das, S., Chakrabarti, S.: Enhanced answer type inference from questions using sequential models. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 315–322. Association for Computational Linguistics (2005)
Li, X., Roth, D.: Learning question classifiers. In: Proceedings of the 19th International Conference on Computational Linguistics, vol. 1, pp. 1–7. Association for Computational Linguistics (2002)
Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a large annotated corpus of english: the penn treebank. Comput. Linguist. 19(2), 313–330 (1993). ISSN 0891–2017, URL http://dl.acm.org/citation.cfm?id=972470.972475
McKeown, K.R.: Paraphrasing questions using given and new information. Comput. Linguist. 9(1), 1–10 (1983)
Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)
Moldovan, D., Harabagiu, S., Pasca, M., Mihalcea, R., Girju, R., Goodrum, R., Rus, V.: The structure and performance of an open-domain question answering system. In: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, pp. 563–570. Association for Computational Linguistics (2000)
Prager, J., Chu-Carroll, J., Czuba, K.: Statistical answer-type identification in open-domain question answering. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 150–156. Morgan Kaufmann Publishers Inc. (2002)
Siddharthan, A.: Syntactic simplification and text cohesion. Res. Lang. Comput. 4(1), 77–109 (2006)
Simmons, R.F.: Answering english questions by computer: a survey. Commun. ACM 8(1), 53–70 (1965). ISSN 0001-0782, doi:10.1145/363707.363732, URL http://doi.acm.org/10.1145/363707.363732
Ullmann, J.R.: An algorithm for subgraph isomorphism. J. ACM (JACM) 23(1), 31–42 (1976)
Wales, J., Sanger, L.: Wikipedia, the free encyclopedia (2001). Accessed April 22, 2013, URL http://en.wikipedia.org/w/index.php?title=Wikipedia&oldid=551616049
Walker, A., Starkey, A., Pan, J.Z., Siddharthan, A.: Making test corpora for question answering more representative. In: Kanoulas, E., Lupu, M., Clough, P., Sanderson, M., Hall, M., Hanbury, A., Toms, E. (eds.) CLEF 2014. LNCS, vol. 8685, pp. 1–6. Springer, Heidelberg (2014)
Waltz, D.L.: An english language question answering system for a large relational database. Commun. ACM 21(7), 526–539 (1978)
Woods, W.A.: Progress in natural language understanding: an application to lunar geology. In: Proceedings of the June 4–8, National Computer Conference and Exposition, AFIPS 1973, pp. 441–450. ACM, New York (1973). doi:10.1145/1499586.1499695, URL http://doi.acm.org/10.1145/1499586.1499695
Woods, W.A.: Lunar rocks in natural english. Linguist. Struct. Process. 5, 521–569 (1977)
Acknowledgement
This research has been partly funded by the European Commission within the 7th Framework Programme/Marie Curie Industry-Academia Partnerships and Pathways schema/PEOPLE Work Programme 2011 project K-Drive number 286348 (cf. http://www.kdrive-project.eu).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Walker, A.D., Alexopoulos, P., Starkey, A., Pan, J.Z., Gómez-Pérez, J.M., Siddharthan, A. (2016). Answer Type Identification for Question Answering. In: Qi, G., Kozaki, K., Pan, J., Yu, S. (eds) Semantic Technology. JIST 2015. Lecture Notes in Computer Science(), vol 9544. Springer, Cham. https://doi.org/10.1007/978-3-319-31676-5_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-31676-5_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31675-8
Online ISBN: 978-3-319-31676-5
eBook Packages: Computer ScienceComputer Science (R0)