2020
DOI: 10.3390/sym12111864
|View full text |Cite
|
Sign up to set email alerts
|

A Review of Unsupervised Keyphrase Extraction Methods Using Within-Collection Resources

Abstract: An essential part of a text generation task is to extract critical information from the text. People usually obtain critical information in the text via manual extraction; however, the asymmetry between the ability to process information manually and the speed of information growth makes it impossible. This problem can be solved by automatic keyphrase extraction. In this paper, the mainstream unsupervised methods to extract keyphrases are summarized, and we analyze in detail the reasons for the differences in … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(10 citation statements)
references
References 32 publications
0
7
0
Order By: Relevance
“…In addition, the use and identification of stopwords and custom_stopwords is essential and there are several alternative approaches to determine them ( Gerlach et al, 2019 ; Sarica and Luo, 2021 ). Similarly, there are additional algorithms for keyword and keyphrase extraction and sentiment analyses not used in this study ( Siddiqi and Sharan, 2015 ; Thelwall, 2017 ; Papagiannopoulou and Tsoumakas, 2020 ; Sun et al, 2020 ). The strategies used here were based on simplicity and the nature of the short text available for analyses in tweets, although we note more complex approaches are available for their analyses ( Edo-Osagie et al, 2020 ).…”
Section: Discussionmentioning
confidence: 99%
“…In addition, the use and identification of stopwords and custom_stopwords is essential and there are several alternative approaches to determine them ( Gerlach et al, 2019 ; Sarica and Luo, 2021 ). Similarly, there are additional algorithms for keyword and keyphrase extraction and sentiment analyses not used in this study ( Siddiqi and Sharan, 2015 ; Thelwall, 2017 ; Papagiannopoulou and Tsoumakas, 2020 ; Sun et al, 2020 ). The strategies used here were based on simplicity and the nature of the short text available for analyses in tweets, although we note more complex approaches are available for their analyses ( Edo-Osagie et al, 2020 ).…”
Section: Discussionmentioning
confidence: 99%
“…e relation between five features can be expressed through equation (1), where S(w) is the measure for each word. After calculating the measure for each word, the final keyword is calculated utilizing a 3-gram model [15]:…”
Section: Unsupervised Keyword Extractionmentioning
confidence: 99%
“…Before the emergence of technology, this information could be processed by humans, which was very time-consuming. Furthermore, due to the inconsistencies between the amount of data and manual data processing skills, it is challenging to complete this vast information, leading to automated keyphrase extraction systems that utilise computers' extensive computational capability to substitute manual labour [2], [3].…”
Section: Introductionmentioning
confidence: 99%