Twenty-First International Conference on Machine Learning - ICML '04 2004
DOI: 10.1145/1015330.1015349
|View full text |Cite
|
Sign up to set email alerts
|

Active learning using pre-clustering

Abstract: The paper is concerned with two-class active learning. While the common approach for collecting data in active learning is to select samples close to the classification boundary, better performance can be achieved by taking into account the prior data distribution. The main contribution of the paper is a formal framework that incorporates clustering into active learning. The algorithm first constructs a classifier on the set of the cluster representatives, and then propagates the classification decision to the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
241
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 464 publications
(254 citation statements)
references
References 19 publications
3
241
0
Order By: Relevance
“…Nguyen and Smeulders [9] suggest a probabilistic framework where clustering information is incorporated into the active sampling scheme. They argue that data points lying on the classification boundary are informative, but using information about the underlying data distribution helps to select better examples.…”
Section: Motivation For Dualmentioning
confidence: 99%
See 4 more Smart Citations
“…Nguyen and Smeulders [9] suggest a probabilistic framework where clustering information is incorporated into the active sampling scheme. They argue that data points lying on the classification boundary are informative, but using information about the underlying data distribution helps to select better examples.…”
Section: Motivation For Dualmentioning
confidence: 99%
“…Nguyen and Smeulders [9] assume a clustering structure of the underlying data. x is the data and y ∈ {+1, 0} is the class label.…”
Section: Density Weighted Uncertainty Sampling (Dwus)mentioning
confidence: 99%
See 3 more Smart Citations