Domain adaptation and sample bias correction theory and algorithm for regression

Cortes, Corinna; Mohri, Mehryar

doi:10.1016/j.tcs.2013.09.027

Cited by 118 publications

(130 citation statements)

References 18 publications

Supporting

Mentioning

112

Contrasting

Order By: Relevance

“…The discrepancy has several advantages over a measure such as the L 1 or total variation distance (Cortes and Mohri, 2013): it is a finer measure than the L 1 distance, it takes into account the loss function and the hypothesis set, it can be accurately estimated from finite samples for common hypothesis sets such as kernel-based ones, it is symmetric and verifies the triangle inequality. It further defines a distance in the case of an L p loss used with a universal kernel such as a Gaussian kernel.…”

Section: Previous Workmentioning

confidence: 99%

“…Using q min instead of Q amounts to reweighting the loss on the training samples to minimize the discrepancy between the empirical distribution and P . Besides its theoretical motivation, this algorithm has been shown to outperform several other algorithms in a series of experiments carried out by (Cortes and Mohri, 2013). Observe that, by definition, the solution q min of discrepancy minimization is obtained by minimizing a maximum over all pairs of hypotheses, that is max h,h ∈H |L P (h, h ) − L q min (h, h )|.…”

Section: Previous Workmentioning

confidence: 99%

“…Assume as in several previous studies (Mansour et al, 2009;Cortes and Mohri, 2013) that the standard algorithm selected by the learner is regularized risk minimization over the Hilbert space H induced by a PSD kernel K. This covers a broad family of algorithms frequently used in applications. Ideally, that is in the absence of a domain adaptation problem, the learner would have access to the labels of the points in T .…”

Section: Main Ideamentioning

confidence: 99%

“…Theorem 2 ( (Cortes and Mohri, 2013)) Let q be an arbitrary distribution over S X and let h * and h q be the hypotheses minimizing λ h 2…”

Section: Learning Bounds and Comparisonsmentioning

confidence: 99%

See 3 more Smart Citations

Adaptation Algorithm and Theory Based on Generalized Discrepancy

Cortes

Mohri

Medina

2015

Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Self Cite

View full text Add to dashboard Cite

We present a new algorithm for domain adaptation improving upon a discrepancy minimization algorithm previously shown to outperform a number of algorithms for this task. Unlike many previous algorithms for domain adaptation, our algorithm does not consist of a fixed reweighting of the losses over the training sample. We show that our algorithm benefits from a solid theoretical foundation and more favorable learning bounds than discrepancy minimization. We present a detailed description of our algorithm and give several efficient solutions for solving its optimization problem. We also report the results of several experiments showing that it outperforms discrepancy minimization.

show abstract

Section: Previous Workmentioning

confidence: 99%

Section: Previous Workmentioning

confidence: 99%

Section: Main Ideamentioning

confidence: 99%

“…Theorem 2 ( (Cortes and Mohri, 2013)) Let q be an arbitrary distribution over S X and let h * and h q be the hypotheses minimizing λ h 2…”

Section: Learning Bounds and Comparisonsmentioning

confidence: 99%

See 2 more Smart Citations

Adaptation Algorithm and Theory Based on Generalized Discrepancy

Cortes

Mohri

Medina

2015

Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Self Cite

View full text Add to dashboard Cite

show abstract

“…This behavior is generally shared by all supervised approaches to bio/geophysical parameter retrieval and suggests that, before applying the trained estimator to geographical regions distinct from those where the training samples were located, additional testing with data collected from those regions is expected to be necessary. The combination with domain adaptation techniques (e.g., [67]) could also be an interesting extension aimed at favoring application to areas without training samples.…”

Section: Discussionmentioning

confidence: 99%

Estimation of Air Surface Temperature From Remote Sensing Images and Pixelwise Modeling of the Estimation Uncertainty Through Support Vector Machines

Moser

Martino

2015

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

The knowledge of air temperature near the Earth's surface plays a relevant role in weather and climate studies as well as in the framework of solar energy management; e.g., for identifying the most suitable locations for a new solar installation or monitoring the performance of existing systems. Remote sensing allows air temperature to be estimated on a spatially distributed basis, thus complementing the spatially sparse observations collected by ground micro-meteorological stations. In this paper, a novel approach to periodic (e.g., daily or monthly) air temperature estimation from satellite images based on support vector machines (SVMs) is proposed. A recently developed SVM-based approach to supervised land and sea surface temperature estimation using satellite images is generalized to the case of air temperature and integrated with case-specific techniques aimed at computing periodic statistics of air temperature using the expectation-maximization algorithm. The method is fully automated and allows the statistics of the estimation error to be modeled on a pixelwise basis. This last result is accomplished by combining nonstationary multidimensional stochastic processes and Clark's variance approximation. The method is experimentally validated with MSG-SEVIRI data acquired over Provence-Alpes-Côte d'Azur (France).

show abstract

Synthetic minority oversampling for function approximation problems

Pelayo

Dick

2019

Int J Intell Syst

View full text Add to dashboard Cite

Imbalanced data sets are a common occurrence in important machine learning problems. Research in improving learning under imbalanced conditions has largely focused on classification problems (ie, problems with a categorical dependent variable). However, imbalanced data also occur in function approximation, and far less attention has been paid to this case. We present a novel stratification approach for imbalanced function approximation problems. Our solution extends the SMOTE oversampling preprocessing technique to continuous-valued dependent variables by identifying regions of the feature space with a low density of examples and high variance in the dependent variable.Synthetic examples are then generated between nearest neighbors in these regions. In an empirical validation, our approach reduces the normalized mean-squared prediction error in 18 out of 21 benchmark data sets, and compares favorably with state-of-the-art approaches. K E Y W O R D S data mining, learning from imbalanced data sets, machine learning, sample selection bias, stratification PELAYO AND DICK | 2743

show abstract

Domain adaptation and sample bias correction theory and algorithm for regression

Cited by 118 publications

References 18 publications

Adaptation Algorithm and Theory Based on Generalized Discrepancy

Adaptation Algorithm and Theory Based on Generalized Discrepancy

Estimation of Air Surface Temperature From Remote Sensing Images and Pixelwise Modeling of the Estimation Uncertainty Through Support Vector Machines

Synthetic minority oversampling for function approximation problems

Contact Info

Product

Resources

About