Proceedings of the 25th International Conference on Machine Learning - ICML '08 2008
DOI: 10.1145/1390156.1390254
|View full text |Cite
|
Sign up to set email alerts
|

Estimating labels from label proportions

Abstract: Consider the following problem: given sets of unlabeled observations, each set with known label proportions, predict the labels of another set of observations, also with known label proportions. This problem appears in areas like e-commerce, spam filtering and improper content detection. We present consistent estimators which can reconstruct the correct labels with high probability in a uniform convergence sense. Experiments show that our method works well in practice.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
163
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 108 publications
(165 citation statements)
references
References 14 publications
2
163
0
Order By: Relevance
“…Additionally, we will consider combining labeled and unlabeled data using semi-supervised learning from label proportions (Quadrianto et al, 2009;Ganchev et al, 2010;Mann and McCallum, 2010).…”
Section: Discussionmentioning
confidence: 99%
“…Additionally, we will consider combining labeled and unlabeled data using semi-supervised learning from label proportions (Quadrianto et al, 2009;Ganchev et al, 2010;Mann and McCallum, 2010).…”
Section: Discussionmentioning
confidence: 99%
“…We have compared the LLP algorithm to three state-of-the-art methods for learning from label proportions: The Mean Map method [19], Inverse Calibration (Invcal) [21] and AOC Kernel k-Means (AOC-KK) [6]. For a further discussion of these methods, see Sect.…”
Section: Methodsmentioning
confidence: 99%
“…Quadrianto et al [19] have proposed the Mean Map method which estimates the conditional class probability P (Y |X, θ) by conditional exponential models, using a feature map Φ(X, Y ) and a normalization function g:…”
Section: Related Workmentioning
confidence: 99%
“…But naive grid labeling provide too little information (only the label of the major class in a cell) on sample classes, which will inevitably introduce label uncertainty and eventually reduce the classification accuracy. According to the idea of learning with label proportions [9][10][11], we can learn a model to predict labels of the individual samples by grouping the training samples and providing proportions of the labels in each group. However, the current definition of label proportion [11] ignores spatial sources of samples in each group, which means samples in a group are not necessarily from the same local region, making it not convenient for sample labeling of remote sensing images.…”
Section: Proportional Grid Labelingmentioning
confidence: 99%
“…The issue of learning from label proportions is raising attentions in the machine learning area [9][10][11][12]. For learning from label proportions, the training samples are divided into groups and label proportions of samples in each group are given as sample truth, instead of giving the label of each sample in the training set [11].…”
Section: Introductionmentioning
confidence: 99%