2008
DOI: 10.1007/978-3-540-88636-5_11
|View full text |Cite
|
Sign up to set email alerts
|

Effect of Preprocessing on Extractive Summarization with Maximal Frequent Sequences

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0
3

Year Published

2013
2013
2020
2020

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 12 publications
0
4
0
3
Order By: Relevance
“…Even the combination of AR, WSD and TE could not reach it. It can be concluded that for the TS systems based on the unigrams as opposed to the multiword descriptions [16] stopwords filtering is essential. The best result in this range of settings is 0.40629.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Even the combination of AR, WSD and TE could not reach it. It can be concluded that for the TS systems based on the unigrams as opposed to the multiword descriptions [16] stopwords filtering is essential. The best result in this range of settings is 0.40629.…”
Section: Resultsmentioning
confidence: 99%
“…However, not all the extractive TS approached equally benefit from the stopwords filtering. Ledeneva et al [16] have shown that removing the stopwords yields worse results for TS systems based on the multiword descriptions.…”
Section: Stopwords Filteringmentioning
confidence: 99%
“…Case folding adalah tahapan yang berfungsi untuk mengubah font, serta mengubah semua huruf menjadi huruf lowercase [11]. Stopwords removal adalah tahapan text preprocessing yang akan menghilangkan stopwords dalam suatu teks [12]. Contoh stopword dalam bahasa Indonesia adalah "yang", "dan", "di", dan lain sebagainya.…”
Section: B Text Miningunclassified
“…En diversas aplicaciones del PLN se han hecho trabajos sobre pre-procesamiento uno de ellos es el de Ledeneva [27], en donde se analiza la importancia del preprocesamiento, en la generación automática de resúmenes utilizando secuencias frecuentes maximales. Las técnicas de pre-procesamiento que utilizaron fueron análisis léxico como eliminación de signos de puntuación, normalización de números y algunas variantes de stopwords y stemming.…”
Section: Estado Del Arteunclassified