2009 First International Conference on Information Science and Engineering 2009
DOI: 10.1109/icise.2009.1163
|View full text |Cite
|
Sign up to set email alerts
|

Text Clustering Based on Key Phrases

Abstract: Text clustering is a hot and essential topic in data mining and information retrieval. This paper proposed a KP-FCM clustering method, which used the key phrases as text features and applied the Fuzzy c-means (FCM) as clustering algorithm. In this method, key phrases were extracted by an algorithm based on suffix array. Experimental results on two standard text clustering benchmark corpuses, OHSUMED (English) and the SOGOU corpus (Chinese) showed that this KP-FCM algorithm outperformed STC-10, Lingo in terms o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0
4

Year Published

2011
2011
2018
2018

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 7 publications
0
2
0
4
Order By: Relevance
“…Misalnya satu set dokumen D dikategorikan menjadi beberapa kelas (3) dan (4). F i merupakan fungsi dari precision dan recall [10].…”
Section: Precision Recall Dan F-measureunclassified
See 3 more Smart Citations
“…Misalnya satu set dokumen D dikategorikan menjadi beberapa kelas (3) dan (4). F i merupakan fungsi dari precision dan recall [10].…”
Section: Precision Recall Dan F-measureunclassified
“…Untuk menghindari overlap seperti ini, cluster-cluster yang topiknya serupa dihubungkan hanya pada satu kelas [10].…”
Section: Precision Recall Dan F-measureunclassified
See 2 more Smart Citations
“…Ontologies can be used to represent documents at a semantic level [ 8 , 9 ], but this concept-based model needs a well-defined database or a gold standard set for mapping words to pre-defined concepts. Key phrase-based approaches were also proposed for text clustering [ 10 , 11 ]. Key phrase extraction constructs a human-friendly feature set.…”
Section: Introductionmentioning
confidence: 99%