2021
DOI: 10.33633/jais.v6i1.4454
|View full text |Cite
|
Sign up to set email alerts
|

Keyphrase Extraction on Covid-19 Tweets Based on Doc2Vec and YAKE

Abstract: Keyword and keyphrase extraction are one of the initial foundations for performing several text processing operations such as summarization and document clustering. YAKE is one of the techniques used for unsupervised and independent keyphrase extraction, it does not require a corpus for linguistic tools such as NER and POS-tag. However, the use of YAKE in microblogging documents such as Twitter often results in a keyphrase that is less representative because of the lack of words used for ranking. This paper of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…is not dependent on the subject words, and it may produce less representative candidates. To address this problem, Firdausillah [35] introduced Doc2Vec, which belongs to the cross-document model, and can find similar documents in multiple ones based on the assumption of correlation and then combines Doc2Vec with YAKE! to calculate the similarity of the documents and merge similar ones.…”
Section: Unsupervised Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…is not dependent on the subject words, and it may produce less representative candidates. To address this problem, Firdausillah [35] introduced Doc2Vec, which belongs to the cross-document model, and can find similar documents in multiple ones based on the assumption of correlation and then combines Doc2Vec with YAKE! to calculate the similarity of the documents and merge similar ones.…”
Section: Unsupervised Methodsmentioning
confidence: 99%
“…Based on (7), Y-Rank merges statistical and semantic feature scores and outputs the ranked results in descending order (lines [32][33][34][35].…”
Section: Y-rank Implementationmentioning
confidence: 99%