Particle Grey Wolf Optimizer (PGWO) Algorithm and Semantic Word Processing for Automatic Text Clustering

Vidyadhari, Ch.; Sandhya, N.; Premchand, P.

doi:10.1142/s0218488519500090

Cited by 10 publications

(2 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The accuracy of text clustering is reduced due to several unnecessary words and large dimensions. The semantic word processing and novel Particle Grey Wolf Optimizer (PGWO) for efficient text clustering are introduced in Reference [59]. First, the text documents are provided as input to the initial phase, which offers valuable keyword for clustering and feature extraction.…”

Section: Gray Wolf Optimizer (Gwo)mentioning

confidence: 99%

Advances in Meta-Heuristic Optimization Algorithms in Big Data Text Clustering

et al. 2021

View full text Add to dashboard Cite

This paper presents a comprehensive survey of the meta-heuristic optimization algorithms on the text clustering applications and highlights its main procedures. These Artificial Intelligence (AI) algorithms are recognized as promising swarm intelligence methods due to their successful ability to solve machine learning problems, especially text clustering problems. This paper reviews all of the relevant literature on meta-heuristic-based text clustering applications, including many variants, such as basic, modified, hybridized, and multi-objective methods. As well, the main procedures of text clustering and critical discussions are given. Hence, this review reports its advantages and disadvantages and recommends potential future research paths. The main keywords that have been considered in this paper are text, clustering, meta-heuristic, optimization, and algorithm.

show abstract

Section: Gray Wolf Optimizer (Gwo)mentioning

confidence: 99%

Advances in Meta-Heuristic Optimization Algorithms in Big Data Text Clustering

et al. 2021

View full text Add to dashboard Cite

show abstract

“…Innowadaysdocument clusteringisaveryactiveresearch field, andmany approaches have beenestablishedtodealwithit (DipakandMukesh,2011;OikonomakouandVazirgiannis,2009;Zamir and Etzioni, 1998;Vidyadhari et al, 2019). They are categorized into two major classes: thehierarchicalandthepartitioningbasedclustering.Thedifferencebetweenthesetwocategories ofclusteringmethodsresidesinthepropertiesofthedeliveredclusters.Inthepartitioningbased clustering,thedataaredirectlydividedintoapredefinednumberofdisjointgroups.However,inthe hierarchicalclustering,adendrogramisgeneratedinlevels'sequences,ineachone,apartitioning clusteringisrealizedwithafixednumberofclusters.Itvariesfromsingletonclusterstoonecluster containingallthedata.Itsunsupervisednaturemakesclusteringasoneofthemostdifficultproblems ofdatamining.Furthermore,itisconsideredasanNP-hardproblem (XuandWunsch,2005;Jain etal.,1999;Dubes,1993).Oneshouldnoticethatthetimecomplexityofhierarchicalclustering isquadratic,whereasitisalmostlinearinthepartitioningapproaches.Therefore,thepartitioning approachesaremoresuitableforclusteringlarge-scaledatasets.…”

Section: Introductionmentioning

confidence: 99%

Biomedical Document Clustering Based on Accelerated Symbiotic Organisms Search Algorithm

Boushaki

Bendjeghaba

Kamel

2021

International Journal of Swarm Intelligence Research

View full text Add to dashboard Cite

Clustering is an important unsupervised analysis technique for big data mining. It finds its application in several domains including biomedical documents of the MEDLINE database. Document clustering algorithms based on metaheuristics is an active research area. However, these algorithms suffer from the problems of getting trapped in local optima, need many parameters to adjust, and the documents should be indexed by a high dimensionality matrix using the traditional vector space model. In order to overcome these limitations, in this paper a new documents clustering algorithm (ASOS-LSI) with no parameters is proposed. It is based on the recent symbiotic organisms search metaheuristic (SOS) and enhanced by an acceleration technique. Furthermore, the documents are represented by semantic indexing based on the famous latent semantic indexing (LSI). Conducted experiments on well-known biomedical documents datasets show the significant superiority of ASOS-LSI over five famous algorithms in terms of compactness, f-measure, purity, misclassified documents, entropy, and runtime.

show abstract