Why-Type Question to Query Reformulation for Efficient Document Retrieval

Why-Type Question to Query Reformulation for Efficient Document Retrieval

Manvi Breja, Sanjay Kumar Jain
Copyright: © 2022 |Pages: 18
DOI: 10.4018/IJIRR.289948
Article PDF Download
Open access articles are freely available for download

Abstract

Understanding the actual need of user from a question is very crucial in non-factoid why-question answering as Why-questions are complex and involve ambiguity and redundancy in their understanding. The precise requirement is to determine the focus of question and reformulate them accordingly to retrieve expected answers to a question. The paper analyzes different types of why-questions and proposes an algorithm for each class to determine the focus and reformulate it into a query by appending focal terms and cue phrase ‘because’ with it. Further, a user interface is implemented which asks input why-question, applies different components of question , reformulates it and finally retrieve web pages by posing query to Google search engine. To measure the accuracy of the process, user feedback is taken which asks them to assign scoring from 1 to 10, on how relevant are the retrieved web pages according to their understanding. The results depict that maximum precision of 89% is achieved in Informational type why-questions and minimum of 48% in opinionated type why-questions.
Article Preview
Top

Introduction

Question Reformulation is one of the components of Question Analysis module in Question Answering System. Question Reformulation reformulates the input question according to user’s need in order to affect the accuracy of subsequent modules. Why-type non-factoid questions are complex and ambiguous; making them difficult to answer. It is difficult to understand the actual need of user and derive an appropriate non-ambiguous meaning to it. If a correct query is posed to a search engine, it retrieves appropriate web pages that ultimately help in accurate document retrieval. In English language, there are two broad categorizations of questions (1) Factoid questions of type what, where, which, when and who; (2) Non-Factoid questions of type why and how. The factoid questions are simple and non-ambiguous whereas non-factoid questions are complex and difficult to answer.

Question Reformulation plays a crucial role in question answering system. It retransforms question into an appropriate query that depicts the user’s need and thus helps in efficient answer retrieval. The performance of question reformulation affects the performance of subsequent modules, i.e. document, answer candidate extraction and answer re-ranker (Kangavari et al., 2008).

Query reformulation is a key task in today’s web search engines for retrieving accurate and best results corresponding to the users’ query. Query reformulation is a process of modifying original query to resolve problems of ambiguity, vocabulary mismatch and vagueness. There are different techniques to query reformulation viz. (1) query expansion, (2) query suggestion and (3) query refinement (Ooi et al., 2015).

Query expansion expands query based on (a) relevance feedback by finding co-occurring terms, (b) query terms appended by their synonyms retrieved from WordNet and (c) retrieved informative terms for expansion from definition clusters (Bernhard, 2010). Query refinement modifies query based on the users’ past query logs. It doesn’t provide choice to user in selecting terms which can be appended to query. Terms are generated based on user feedback from the top ranked documents irrelevant to its appropriateness which helps in achieving high recall and precision. Finally query suggestion helps understanding the actual information need of user and is found as the most fundamental features of search engines. They are often required in case of rare query being posed, single-term query, unambiguous query suggestions, query suggestions are generalized form of original query and several pages are crawled by user. The approach suggests several other refined query corresponding to original user query based on the users’ interest/search logs analysis so that user can select terms that should be replaced original terms for better document retrieval.

The paper focuses on improving the why-question answering system by reformulating why-questions into an appropriate query that can depict the user’s need and when posed on search engine, help in retrieving appropriate web pages. There are some cases where the actual user need can’t be understood from the question, thus there comes the need for analyzing the question and reformulates it into an appropriate form that can depict the user need from the question.

The organization of paper is described as: Section 2 discusses researches of reformulation. Section 3 puts light on the main focus of the article. Section 4 analyzes different components of question with their impact on reformulation. Section 5 discusses algorithm designed for reformulation of different why-type questions. Section 6 highlights implementation details utilized while designing a user interface for reformulation. Section 7 describes results with their analysis on user feedback. Finally section 8 concludes the work.

Complete Article List

Search this Journal:
Reset
Volume 14: 1 Issue (2024)
Volume 13: 1 Issue (2023)
Volume 12: 4 Issues (2022): 3 Released, 1 Forthcoming
Volume 11: 4 Issues (2021)
Volume 10: 4 Issues (2020)
Volume 9: 4 Issues (2019)
Volume 8: 4 Issues (2018)
Volume 7: 4 Issues (2017)
Volume 6: 4 Issues (2016)
Volume 5: 4 Issues (2015)
Volume 4: 4 Issues (2014)
Volume 3: 4 Issues (2013)
Volume 2: 4 Issues (2012)
Volume 1: 4 Issues (2011)
View Complete Journal Contents Listing