Parallel Fuzzy c- Means Clustering for Large Data Sets

Kwok, Terence; Smith, Kate; Lozano, Sebastián; Taniar, David

doi:10.1007/3-540-45706-2_48

Cited by 94 publications

(43 citation statements)

References 10 publications

(19 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…FCM finds hyper-spherical clusters and partitions lying in Another approach involves collaborative clustering, where the algorithm runs in every data site and iteratively shares information across sites trying to find a global structure [43,47]. Despite the existence of many algorithms for parallel and distributed data, only a few of them have been developed for fuzzy clustering as a generalization of the original (centralized) ones, e.g., see [48][49][50]. In other words, just a few fuzzy clustering algorithms have been generalized to deal with parallel and distributed data in an exact way.…”

Section: Subsets (Clusters)mentioning

confidence: 99%

See 1 more Smart Citation

Fuzzy Clustering Algorithms and Validity Indices for Distributed Data

Vendramin

Naldi

Campello

2014

Partitional Clustering Algorithms

View full text Add to dashboard Cite

This chapter presents a unified framework to generalize a number of fuzzy clustering algorithms to handle distributed data in an exact way, i.e., with no approximation of results with respect to their original centralized versions. The same framework allows the exact distribution of relative validity indices used to evaluate the quality of fuzzy clustering solutions. Complexity analyses for each distributed algorithm and index are reported in terms of space, time, and communication aspects. A general procedure to estimate the number of clusters in a non-centralized fashion using the proposed framework is also described. Such a procedure is directly applicable not only to distributed data, but to parallel data processing scenarios as well. Experimental results illustrate the speedup obtained when running algorithms under the proposed framework in multiple cores of a processor, when compared to their traditional, centralized counterparts running in a single core. Additionally, the quality of the results and amount of data transmitted are assessed and compared among different fuzzy clustering algorithms.

show abstract

Section: Subsets (Clusters)mentioning

confidence: 99%

“…The algorithm called here Distributed Fuzzy c-Means (DFCM) is based on the ideas on the parallelization of computations in the FCM algorithm [48][49][50] and is a formal generalization of FCM to handle distributed data. Note that Algorithm 1 (FCM in Sect.…”

Section: Dfcm: Distributed Fuzzy C-meansmentioning

confidence: 99%

Fuzzy Clustering Algorithms and Validity Indices for Distributed Data

Vendramin

Naldi

Campello

2014

Partitional Clustering Algorithms

View full text Add to dashboard Cite

show abstract

“…The abstraction tree bears some resemblance to the major familiar quad tree data structure [17] used in the several image processing and image analysis algorithms. Clustering is the process of grouping a data set in a way that the similarity between data within a cluster is maximized while the similarity between data of different clusters is maximized [18] and is used for pattern recognition in image processing. To recognize a given pattern in an image various techniques have been utilized, but in general two broad categories of classifications have been made: unsupervised techniques and supervised techniques.…”

Section: Introductionmentioning

confidence: 99%

An Improved Implementation of Brain Tumor Detection Using Segmentation Based on Neuro Fuzzy Technique

Murugavalli¹,

Rajamani²

2007

J. of Computer Science

View full text Add to dashboard Cite

Implementation of a neuro-fuzzy segmentation process of the MRI data is presented in this study to detect various tissues like white matter, gray matter, csf and tumor. The advantage of hierarchical self organizing map and fuzzy c means algorithms are used to classify the image layer by layer. The lowest level weight vector is achieved by the abstraction level. We have also achieved a higher value of tumor pixels by this neuro-fuzzy approach. The computation speed of the proposed method is also studied. The multilayer segmentation results of the neuro fuzzy are shown to have interesting consequences from the viewpoint of clinical diagnosis. Neuro fuzzy technique shows that MRI brain tumor segmentation using HSOM-FCM also perform more accurate one.

show abstract

“…Mais especificamente, alguns foram desenvolvidos como generalizações de versões centralizadas de um algoritmo específico (Olson, 1995;Dhillon & Modha, 2000;Forman & Zhang, 2000;Garg et al, 2006), sendo capazes de produzir os mesmos resultados finais que seriam obtidos pelos respectivos algoritmos originais se estes pudessem ser aplicados aos dados de forma centralizada. Embora existam muitos algoritmos capazes de lidar com dados paralelos e distribuídos, poucos foram desenvolvidos para agrupamento fuzzy de dados como generalizações de versões centralizadas de determinado algoritmo (Kwok et al, 2002;Rahimi et al, 2004;Modenesi et al, 2007). Em outras palavras, poucos algoritmos de agrupamento fuzzy de dados foram generalizados para trabalhar com dados paralelos e distribuídos de forma a produzir os mesmos resultados finais que a versão centralizada de tal algoritmo obteria com os dados centralizados.…”

Section: Generalização Dos Algoritmos Eíndices Estudadosunclassified

“…O algoritmo denominado DFCM (Distributed Fuzzy c-Means -em inglês) foi originalmente proposto no contexto paralelo (Kwok et al, 2002;Rahimi et al, 2004;Modenesi et al, 2007). Este algoritmo consiste na generalização do algoritmo FCM (Seção 2.2.1) para lidar com dados paralelos ou distribuídos.…”

Section: Dfcm: Distributed Fuzzy C-meansunclassified

Estudo e desenvolvimento de algoritmos para agrupamento fuzzy de dados em cenários centralizados e distribuídos

Vendramin¹

View full text Add to dashboard Cite

Parallel Fuzzy c- Means Clustering for Large Data Sets

Cited by 94 publications

References 10 publications

Fuzzy Clustering Algorithms and Validity Indices for Distributed Data

Fuzzy Clustering Algorithms and Validity Indices for Distributed Data

An Improved Implementation of Brain Tumor Detection Using Segmentation Based on Neuro Fuzzy Technique

Estudo e desenvolvimento de algoritmos para agrupamento fuzzy de dados em cenários centralizados e distribuídos

Contact Info

Product

Resources

About