2013
DOI: 10.1016/j.ins.2012.07.025
|View full text |Cite
|
Sign up to set email alerts
|

Efficient stochastic algorithms for document clustering

Abstract: Clustering has become an increasingly important and highly complicated research area for targeting useful and relevant information in modern application domains such as the World Wide Web. Recent studies have shown that the most commonly used partitioning-based clustering algorithm, the K-means algorithm, is more suitable for large datasets. However, the K-means algorithm may generate a local optimal clustering. In this paper, we present novel document clustering algorithms based on the Harmony Search (HS) opt… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
72
0
9

Year Published

2015
2015
2022
2022

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 110 publications
(82 citation statements)
references
References 39 publications
1
72
0
9
Order By: Relevance
“…The smaller value of ADDC is more compact clustering solution (Forsati et al, 2013). Figure 9 illustrates the quality performance metrics; F-measure, Entropy, Purity and ADDC results between the WFA and WFA II .…”
Section: Results Of Comparison Of Wfa and Wfa IImentioning
confidence: 99%
See 4 more Smart Citations
“…The smaller value of ADDC is more compact clustering solution (Forsati et al, 2013). Figure 9 illustrates the quality performance metrics; F-measure, Entropy, Purity and ADDC results between the WFA and WFA II .…”
Section: Results Of Comparison Of Wfa and Wfa IImentioning
confidence: 99%
“…Term FrequencyInverse Document Frequency (TF-IDF) is a technique that has been widely used to represent documents in the form of numerical weights in the vector space (Manning et al, 2008;Forsati et al, 2013). TF-IDF for each term in a document is equal to the term frequency multiply by the inverse documents frequency, idf, which can be calculated using Equation 5 (Manning et al, 2008):…”
Section: Construction Of a Vector Space Modelmentioning
confidence: 99%
See 3 more Smart Citations