2017 14th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technolo 2017
DOI: 10.1109/ecticon.2017.8096371
|View full text |Cite
|
Sign up to set email alerts
|

Stop words in review summarization using TextRank

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(6 citation statements)
references
References 7 publications
0
4
0
Order By: Relevance
“…Similarity scores as stores in the similarity matrix. This approach models the similarity matrix into graphs, where nodes of the graph represent the sentences present in the documents and the edges represent the semantic relation through which the sentences are connected (Manalu, 2017). The similarity between the nodes is equivalent to the weighted edges of the graph (Balcerzak et al, 2014).…”
Section: Proposed Methodologymentioning
confidence: 99%
See 2 more Smart Citations
“…Similarity scores as stores in the similarity matrix. This approach models the similarity matrix into graphs, where nodes of the graph represent the sentences present in the documents and the edges represent the semantic relation through which the sentences are connected (Manalu, 2017). The similarity between the nodes is equivalent to the weighted edges of the graph (Balcerzak et al, 2014).…”
Section: Proposed Methodologymentioning
confidence: 99%
“…TextRank uses certain ways to calculate the relation between sentences, cosine similarity is one of them as described in (Barrios, Lopez, Argerich, & Wachenchauzer, 2016). The meaningless words generally called stop words need to be removed for better summary production as in (Manalu, 2017), (Qaiser & Ali, 2018). TextRank also helps in determining the review assessment and credibility assessment as in (Manalu & Sundjaja, 2017) and (Balcerzak, Jaworski, & Wierzbicki, 2014) respectively.…”
Section: Related Studymentioning
confidence: 99%
See 1 more Smart Citation
“…Tokenization-splitting the text into smaller units, called tokens, such as words or phrases [34]; 3. Removing stop-words-common words that lack significant meaning [35], such as "the" or "and", were eliminated to reduce the text size and improve the performance of the NLP model; 4. Stemming and lemmatization-leveraging the Sastrawi library, stemming for the Indonesian language (Bahasa) was performed to reduce words to their base form [36], known as the stem or lemma; 5.…”
Section: Pre-processingmentioning
confidence: 99%
“…Removing stop-words-stop-words are common words that do not provide significant meaning [33], such as "the" or "and", and can be removed from the text to reduce its size and improve the performance of the NLP model.…”
mentioning
confidence: 99%