2011
DOI: 10.1017/s1351324911000167
|View full text |Cite
|
Sign up to set email alerts
|

Query-focused multi-document summarization: automatic data annotations and supervised learning approaches

Abstract: In this paper, we apply different supervised learning techniques to build query-focused multi-document summarization systems, where the task is to produce automatic summaries in response to a given query or specific information request stated by the user. A huge amount of labeled data is a prerequisite for supervised training. It is expensive and time-consuming when humans perform the labeling task manually. Automatic labeling can be a good remedy to this problem. We employ five different automatic annotation … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
22
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 27 publications
(22 citation statements)
references
References 29 publications
0
22
0
Order By: Relevance
“…(ii ) solving an optimization problem [20,8,27]: these approaches cast the summarization problem as an optimization problem where an objective function needs to be optimized with respect to some constraints. (iii ) supervised models [60,25,18], where selection of sentences in the summary are learned using a supervised framework. (iv ) graph based [29,53,62]: these approaches seek to find the most central sentences in a document's graph where sentences are nodes and edges are similarities.…”
Section: Text Summarizationmentioning
confidence: 99%
“…(ii ) solving an optimization problem [20,8,27]: these approaches cast the summarization problem as an optimization problem where an objective function needs to be optimized with respect to some constraints. (iii ) supervised models [60,25,18], where selection of sentences in the summary are learned using a supervised framework. (iv ) graph based [29,53,62]: these approaches seek to find the most central sentences in a document's graph where sentences are nodes and edges are similarities.…”
Section: Text Summarizationmentioning
confidence: 99%
“…Up to now, various extraction-based techniques have been proposed for generic document summarization. 28 In automatic document summarization, the selection process of the distinct ideas included in the document is called diversity. The diversity is very important evidence serving to control the redundancy in the summarized text and produce more appropriate summary.…”
Section: Related Workmentioning
confidence: 99%
“…We use query-focused supervised extractive multi-document summarization technique for this purpose [1][2][3]. Ensemble methods are learning algorithms that construct a set of classifiers and then classify new data points by taking a (weighted) vote of their predictions [4].…”
Section: Introductionmentioning
confidence: 99%
“…Supervised classifiers are typically trained on data pairs, defined by feature vectors and corresponding class labels. We use an automatic labeling approach to annotate the training data using ROUGE [1,3,9]. From each sentence of the training (and testing) data, we extract different query-related features and importance-oriented features such as: n-gram overlap, Longest Common Subsequence (LCS), Weighted LCS (WLCS), skip-bigram, exact word overlap, synonym overlap, hypernym/hyponym overlap, gloss overlap, Basic Element (BE) overlap, syntactic tree similarity measure, position of sentences, length of sentences, Named Entity (NE) match, cue word match and title match [1,3,5,13].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation