2013
DOI: 10.5430/air.v2n3p35
|View full text |Cite
|
Sign up to set email alerts
|

The role of statistical and semantic features in single-document extractive summarization

Abstract: This paper reports on the further results of the ongoing research analyzing the impact of a range of commonly used statistical and semantic features in the context of extractive text summarization. The features experimented with include word frequency, inverse sentence and term frequencies, stopwords filtering, word senses, resolved anaphora and textual entailment. The obtained results demonstrate the relative importance of each feature and the limitations of the tools available. It has been shown that the inv… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…The last two approaches, [37], used anaphora resolution and a Word Sense Disambiguation (WSD) method for enriching the document with semantic knowledge, and extracting the most important sentences based on the number of concepts they contained, instead of terms. The difference between them was the WSD method employed: MFS for the most frequent sense, and UKB for a PageRank-based WSD method [38].…”
Section: Quantitative Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The last two approaches, [37], used anaphora resolution and a Word Sense Disambiguation (WSD) method for enriching the document with semantic knowledge, and extracting the most important sentences based on the number of concepts they contained, instead of terms. The difference between them was the WSD method employed: MFS for the most frequent sense, and UKB for a PageRank-based WSD method [38].…”
Section: Quantitative Resultsmentioning
confidence: 99%
“…This is the case of COMPENDIUM summarizer [36], which employs textual entailment together with statistical and linguistic-based features for scoring sentences, and determines which ones are more relevant to take part in the summary. Also, the approach proposed in [37] analyzed different combinations of statistical and linguistic settings, such as anaphora resolution together with Word Sense Disambiguation (WSD) methods for extracting the most important sentences based on the number of concepts they contained, instead of terms.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The current difficulty associated to building natural language generation systems [15] and our considerable experience in text summarisation for extracting key ideas [16], [17], [18] has led us to address this study from a summarisation perspective rather than from natural language generation, even though generating natural language and applying it to Social Media (e.g., Twitter) would be our ultimate long-term goal.…”
Section: Related Workmentioning
confidence: 99%