2015
DOI: 10.1016/j.knosys.2014.12.019
|View full text |Cite
|
Sign up to set email alerts
|

Labeling clusters from both linguistic and statistical perspectives: A hybrid approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0
2

Year Published

2016
2016
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(11 citation statements)
references
References 15 publications
0
9
0
2
Order By: Relevance
“…Pemilihan frasa kandidat dapat menggunakan pendekatan statistik, berbasis graf, atau klasterisasi topik. Pemilihan frasa dengan pendekatan statistik antara lain dengan pembobotan Term Frequency -Inverse Cluster Frequency (TF-ICF) [27], menggunakan perhitungan Markov Chain [11], dan pemberian nilai frasa kandidat berdasarkan Pointwise Mutual Information (PMI) [28]. Pemilihan frasa kandidat berbasis graf sebagai representasi teks seperti TextRank [23]…”
Section: Pelabelan Klaster Dan Klasterisasi Topikunclassified
See 1 more Smart Citation
“…Pemilihan frasa kandidat dapat menggunakan pendekatan statistik, berbasis graf, atau klasterisasi topik. Pemilihan frasa dengan pendekatan statistik antara lain dengan pembobotan Term Frequency -Inverse Cluster Frequency (TF-ICF) [27], menggunakan perhitungan Markov Chain [11], dan pemberian nilai frasa kandidat berdasarkan Pointwise Mutual Information (PMI) [28]. Pemilihan frasa kandidat berbasis graf sebagai representasi teks seperti TextRank [23]…”
Section: Pelabelan Klaster Dan Klasterisasi Topikunclassified
“…Kata tunggal sebagai label klaster dianggap kurang intuitif sehingga frasa kata diutamakan karena lebih deskriptif bagi representasi topik dengan gabungan pendekatan linguistik serta statistik [11]. Klasterisasi memberikan hasil kurang optimal jika beberapa kelompok dokumen masih memiliki kemiripan kontekstual seperti sinonim, polisemi, atau ambiguitas [12].…”
unclassified
“…Their results labelled clusters with an average above 88.79% of elements correctly. Li et al (2015) developed a combine approach of both linguistic and statistical perspectives to label the clusters. Performance of their approach is evaluated on 20-Newsgroups (English) and NewsMiner (Chinese) datasets.…”
Section: Literature Surveymentioning
confidence: 99%
“…The problem of cluster labeling has been subject to different interesting researches in the literature [12,4,[15][16][17]. These researches have explored different techniques to achieve cluster labeling.…”
Section: Related Workmentioning
confidence: 99%
“…The presented algorithm assigns few labels to the clusters based on the cluster analysis information, the parent cluster and statistics about the corpus. Recently, Li et al [12] proposed an hybrid approach, combining linguistic and statistical techniques to achieve an automated labeling of the clusters. Although these approaches are very interesting, they are all dedicated to textual data and cannot work on quantitative data.…”
Section: Related Workmentioning
confidence: 99%