2019
DOI: 10.1109/tkde.2018.2840127
|View full text |Cite
|
Sign up to set email alerts
|

Comments Mining With TF-IDF: The Inherent Bias and Its Removal

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
34
0
6

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 82 publications
(45 citation statements)
references
References 49 publications
0
34
0
6
Order By: Relevance
“…The research on the mining of attribute-opinion word pairs has attracted wide attention, mainly including the following three aspects: (a) The mining of attribute-opinion word pairs is regarded as a task of "keyword" extraction, and these keywords are extracted with unsupervised methods, for example, latent Dirichlet allocation (LDA), 12,13 TextRank, 14,15 and term frequencyinverse document frequency (TF-IDF). 16,17 However, those unsupervised methods have their limitations. That is only words can be accurately extracted rather than research contents in a certain context, and phrases cannot be analyzed.…”
Section: Attribute-opinion Pairs Miningmentioning
confidence: 99%
“…The research on the mining of attribute-opinion word pairs has attracted wide attention, mainly including the following three aspects: (a) The mining of attribute-opinion word pairs is regarded as a task of "keyword" extraction, and these keywords are extracted with unsupervised methods, for example, latent Dirichlet allocation (LDA), 12,13 TextRank, 14,15 and term frequencyinverse document frequency (TF-IDF). 16,17 However, those unsupervised methods have their limitations. That is only words can be accurately extracted rather than research contents in a certain context, and phrases cannot be analyzed.…”
Section: Attribute-opinion Pairs Miningmentioning
confidence: 99%
“…Algoritma TF-IDF digunakan untuk melakukan perhitungan bobot komentar dan mengklasifikasikannya ke dalam 2 kelas (komentar potensial dan komentar tidak potensial). Pembobotan TF-IDF umumnya digunakan dalam penambangan teks dan pencarian informasi untuk mengevaluasi pentingnya istilah linguistik (umumnya unigram atau bigram) dalam korpus yang diteliti [14]. Perhitungan TF-IDF menggunakan persamaan 1.…”
Section: Loginunclassified
“…TF adalah merupakan jumlah kemunculan setiap kata pada setiap dokumen dan IDF merepresentasikan jumlah dokumen yang memiliki kata tertentu berdasarkan jumlah kata dalam teks. Perhitungan TF-IDF dapat dilakukan dengan menggunakan rumus seperti pada persamaan 1 [12].…”
Section: Ekstraksi Fiturunclassified