Apontamentos sobre pesquisa qualitativa e pesquisa empírico-fenomenológica

Data clustering is an unsupervised learning task that has found many applications in various scientific fields. The goal is to find subgroups of closely related data samples (clusters) in a set of unlabeled data. Kernel k-Means is a state of the art clustering algorithm. However, in contrast to clustering algorithms that can work using only a limited percentage of the data at a time, Kernel k-Means is a global clustering algorithm. It requires the computation of the kernel matrix, which takes O(n 2 d) time and O(n 2 ) space in memory. As datasets grow larger, the application of Kernel k-Means becomes infeasible on a single computer, a fact that strongly suggests a distributed approach. In this paper, we present such an approach to the Kernel k-Means clustering algorithm, in order to make its application to a large number of samples feasible and, thus, achieve high performance clustering results on very big datasets. Our distributed approach follows the MapReduce programming model and consists of 3 stages, the kernel matrix computation, a novel matrix trimming method and the Kernel k-Means clustering algorithm.

show abstract

Shape matching using a binary search tree structure of weak classifiers

Tsapanos

Tefas

Nikolaidis

et al. 2012

Pattern Recognition

View full text Add to dashboard Cite

Graph representations using adjacency matrix transforms for clustering

Tsapanos

Pitas

Nikolaidis

2012

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Nikolaos Tsapanos

A distributed framework for trimmed Kernel k-Means clustering

Shape matching using a binary search tree structure of weak classifiers

Graph representations using adjacency matrix transforms for clustering

Contact Info

Product

Resources

About