2016
DOI: 10.1016/j.neucom.2016.04.018
|View full text |Cite
|
Sign up to set email alerts
|

On the suitability of Prototype Selection methods for kNN classification with distributed data

Abstract: In the current Information Age, data production and processing demands are ever increasing. This has motivated the appearance of large-scale distributed information. This phenomenon also applies to Pattern Recognition so that classic and common algorithms, such as the k-Nearest Neighbour, are unable to be used. To improve the efficiency of this classifier, Prototype Selection (PS) strategies can be used. Nevertheless, current PS algorithms were not designed to deal with distributed data, and their performance … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
1

Relationship

3
5

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 34 publications
0
7
0
Order By: Relevance
“…Finally, we decided to use an additional metric that relates the performance and degree of reduction: the so-called estimated profit per prototype (Valero-Mas et al 2016). This measure is defined as the ratio between the classification rate and the number of distances computed or, in this context, the number of elements in the training set.…”
Section: Metricsmentioning
confidence: 99%
“…Finally, we decided to use an additional metric that relates the performance and degree of reduction: the so-called estimated profit per prototype (Valero-Mas et al 2016). This measure is defined as the ratio between the classification rate and the number of distances computed or, in this context, the number of elements in the training set.…”
Section: Metricsmentioning
confidence: 99%
“…Some research has focused on this issue to address the problem. For example, Valero-Mas et al [16] have proposed the prototype selection strategies that could be used to develop KNN classification for distributed data.…”
Section: Related Workmentioning
confidence: 99%
“…Finally, in order to provide a single value which relates both the performance and reduction capabilities of the strategies considered, we also consider the estimated profit per prototype measure defined as the ratio between the classification accuracy and the total number of distances computed [30]. It must be mentioned that, for its use in this work, this metric was slightly adapted from its original definition by considering the F 1 instead of the classification accuracy as well as the resulting set size instead of the number of distances computed.…”
Section: Performance Measurementmentioning
confidence: 99%