2009
DOI: 10.1613/jair.2784
|View full text |Cite
|
Sign up to set email alerts
|

Complex Question Answering: Unsupervised Learning Approaches and Experiments

Abstract: Complex questions that require inferencing and synthesizing information from multiple documents can be seen as a kind of topic-oriented, informative multi-document summarization where the goal is to produce a single text as a compressed version of a set of documents with a minimum loss of relevant information. In this paper, we experiment with one empirical method and two unsupervised statistical machine learning techniques: K-means and Expectation Maximization (EM), for computing relative importance of the se… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 26 publications
(23 citation statements)
references
References 23 publications
0
23
0
Order By: Relevance
“…The multi-document summarization problem has received much attention recently (Lyngbaek, 2013;Sood, 2013;Qian and Liu, 2013) due to its ability to reduce large quantities of text to a human processable amount as well as its application in other fields such as question answering (Liu et al, 2008;Chali et al, 2009a;Chali et al, 2009b;Chali et al, 2011b). We expect this trend to further increase as the amount of linguistic data on the web from sources such as social media, wikipedia, and online newswire increases.…”
Section: Introductionmentioning
confidence: 99%
“…The multi-document summarization problem has received much attention recently (Lyngbaek, 2013;Sood, 2013;Qian and Liu, 2013) due to its ability to reduce large quantities of text to a human processable amount as well as its application in other fields such as question answering (Liu et al, 2008;Chali et al, 2009a;Chali et al, 2009b;Chali et al, 2011b). We expect this trend to further increase as the amount of linguistic data on the web from sources such as social media, wikipedia, and online newswire increases.…”
Section: Introductionmentioning
confidence: 99%
“…Many websites that are allowing question answering technology, allow end users to "ask" question to the computer and get exact or related answer [13]. Question and Answering technique utilizes multiple text mining methods.…”
Section: Question Answeringmentioning
confidence: 99%
“…The basic idea behind this method is that the higher the frequency a term occurs in one category than in the other, the more important this term is to this category. The rf weight of a term w j in Ω p can be calculated as: (6) where j w tp is the number of questions that contain w j in Ω p , j w fn denotes the number of questions in which w j appears in Ω u , and ε is a nonnegative constant which is set to 0 in this paper. The rf weight of w j in Ω u can also be calculated similarly.…”
Section: B Popular and Unpopular Termsmentioning
confidence: 99%