2012
DOI: 10.1007/978-3-642-31178-9_7
|View full text |Cite
|
Sign up to set email alerts
|

Extracting Multi-document Summaries with a Double Clustering Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 11 publications
0
10
0
Order By: Relevance
“…The goal of this module is to produce summaries from a certain number of texts of the same topic. This is an adaptation of the work presented in [19], with the difference that the module is designed to be language-independent. After a preprocessing stage that uses the NLP module, the process collects all the recognized named entities and keywords of the text, using the wellknown Term Frequency-Inverse Document Frequency (TF-IDF) algorithm [20] for this second task.…”
Section: A Language Unitmentioning
confidence: 98%
“…The goal of this module is to produce summaries from a certain number of texts of the same topic. This is an adaptation of the work presented in [19], with the difference that the module is designed to be language-independent. After a preprocessing stage that uses the NLP module, the process collects all the recognized named entities and keywords of the text, using the wellknown Term Frequency-Inverse Document Frequency (TF-IDF) algorithm [20] for this second task.…”
Section: A Language Unitmentioning
confidence: 98%
“…Due to our interest in working with multi-documents, we have analyzed other similar works, for example [11], that use domainindependent techniques based mainly on fast statistical processing, a metric for reducing redundancy and maximizing diversity in the selected passages or that use a cluster centroid with techniques such as graph matching, maximal marginal relevance, and language generation. Also we find very interesting the recent contributions of SIMBA [12], which has a smart procedure to simplify sentences to ensure the compression of the summary. To carry out this task, SIMBA applies a two-stage process of clusterization: clustering sentences by similarity and clustering sentences by keyword.…”
Section: Automatic Summariesmentioning
confidence: 98%
“…They further extended bimixture PLSA to incorporate the sentence information, and proposed bimixture PLSA with sentence bases (Bi-PLSAS) to simultaneously cluster and summarize the documents utilizing the mutual influence of the document clustering and summarization procedures. Silveira and Branco [2012] proposed a method for extractive multi-document summarization that explores a two-phase clustering approach. Zhang et al [2012] proposed to rank sentences from a document by exploiting the mutual effects between terms, sentences, and clusters.…”
Section: Related Workmentioning
confidence: 99%