2011
DOI: 10.1007/s10115-010-0375-z
|View full text |Cite
|
Sign up to set email alerts
|

Cluster-based instance selection for machine classification

Abstract: Instance selection in the supervised machine learning, often referred to as the data reduction, aims at deciding which instances from the training set should be retained for further use during the learning process. Instance selection can result in increased capabilities and generalization properties of the learning model, shorter time of the learning process, or it can help in scaling up to large data sources. The paper proposes a cluster-based instance selection approach with the learning process executed by … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0
2

Year Published

2011
2011
2019
2019

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 57 publications
(26 citation statements)
references
References 41 publications
0
24
0
2
Order By: Relevance
“…Selection of prototypes from clusters is one of the promising approaches to obtain "good" representatives of such clusters [50].…”
Section: Approaches Based On Similarity Measuresmentioning
confidence: 99%
“…Selection of prototypes from clusters is one of the promising approaches to obtain "good" representatives of such clusters [50].…”
Section: Approaches Based On Similarity Measuresmentioning
confidence: 99%
“…Copyright: the authorsnested series of partitions for merging or splitting clusters based on the similarity [103]. Partitional clustering algorithms identify the partition that optimises a clustering criterion.…”
Section: Co-published By Atlantis Press and Taylor And Francismentioning
confidence: 99%
“…In his work, Czarnowski uses the similarity coefficient to identify clusters of instances in one of his object selection approaches and uses the Silhouette Coefficient measure to evaluate the cluster quality of each of his methods [3]. A single instance was selected as a prototype of the class and used for training purposes.…”
Section: Related Workmentioning
confidence: 99%
“…The algorithms chosen to compute the centroids can affect their quality and thus, have an impact on the prototype selection. Again, for large datasets, when the number of features is large, ""the quality of the centroids can be poor" [3].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation