2004
DOI: 10.1613/jair.1523
|View full text |Cite
|
Sign up to set email alerts
|

LexRank: Graph-based Lexical Centrality as Salience in Text Summarization

Abstract: We introduce a stochastic graph-based method for computing relative importance of textual units for Natural Language Processing. We test the technique on the problem of Text Summarization (TS). Extractive TS relies on the concept of sentence salience to identify the most important sentences in a document or set of documents. Salience is typically defined in terms of the presence of particular important words or in terms of similarity to a centroid pseudo-sentence. We consider a new approach, LexRank, for compu… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

6
1,777
0
28

Year Published

2006
2006
2022
2022

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 2,101 publications
(1,811 citation statements)
references
References 32 publications
6
1,777
0
28
Order By: Relevance
“…Depending on the attributes of the connection relationship, centrality can be interpreted in various ways (e.g., degree, closeness, and betweenness) [58]. Degree centricity refers to the degree of strong connection and attention in the network, which can be useful as the simplest and most effective indicator of the power relationship across nodes [59,60]. Degree centrality is measured to the extent to which a node is connected to another node in the network.…”
Section: Text Mining: Tf-idf and Degree Centralitymentioning
confidence: 99%
“…Depending on the attributes of the connection relationship, centrality can be interpreted in various ways (e.g., degree, closeness, and betweenness) [58]. Degree centricity refers to the degree of strong connection and attention in the network, which can be useful as the simplest and most effective indicator of the power relationship across nodes [59,60]. Degree centrality is measured to the extent to which a node is connected to another node in the network.…”
Section: Text Mining: Tf-idf and Degree Centralitymentioning
confidence: 99%
“…Beside the difference in the approach (using headlines instead of the full text), our framework differs from MDS in that it takes the relations among events across dates into account. As there is already a rich body of research on multi-document summarization (for example, [22], [21], [16], [8], [17]), in this study we also investigate how good they are in producing daily summaries using only headlines, in the same setting as our approaches.…”
Section: Related Workmentioning
confidence: 99%
“…Since ETS and Chieu et al extract sentences from the full text of news articles for timeline summaries, as shown in the experiments of Yan et al [27], we also would like to see how good (multi-)document summarization would work on the headlines dataset. We consider the following state-of-the-art methods: Centroid [22], LexRank [8], TextRank [19] SumSim selects top news reports and non-duplicated headlines that maximize the sum of TF-IDF similarity with other headlines that are published in the previous and next 10 days. Conceptually, it works similarly to Chieu et al, but on the headline level.…”
Section: Systems For Comparisonmentioning
confidence: 99%
“…Different measurements are used to determine how to represent sentence and how to define connections between sentences. The similarity between two sentences according to their term vectors is used to generate links and define link strength in [4]. Similarly, [3] weighed links by the content overlap of two sentences normalized by the length of each sentence.…”
Section: Related Workmentioning
confidence: 99%