Ievgen Redko scite author profile

Domain adaptation (DA) is an important and emerging field of machine learning that tackles the problem occurring when the distributions of training (source domain) and test (target domain) data are similar but different. This kind of learning paradigm is of vital importance for future advances as it allows a learner to generalize the knowledge across different tasks. Current theoretical results show that the efficiency of DA algorithms depends on their capacity of minimizing the divergence between source and target probability distributions. In this paper, we provide a theoretical study on the advantages that concepts borrowed from optimal transportation theory [17] can bring to DA. In particular, we show that the Wasserstein metric can be used as a divergence measure between distributions to obtain generalization guarantees for three different learning settings: (i) classic DA with unsupervised target data (ii) DA combining source and target labeled data, (iii) multiple source DA. Based on the obtained results, we motivate the use of the regularized optimal transport and provide some algorithmic insights for multi-source domain adaptation. We also show when this theoretical analysis can lead to tighter inequalities than those of other existing frameworks. We believe that these results open the door to novel ideas and directions for DA.

show abstract

A survey on domain adaptation theory: learning bounds and theoretical guarantees

Redko¹,

Morvant²,

Habrard³

et al. 2020

Preprint

View full text Add to dashboard Cite

Iterative Domain Adaptation Methods

Redko¹,

Habrard²,

Morvant³

et al. 2019

View full text Add to dashboard Cite

Feature Selection for Unsupervised Domain Adaptation Using Optimal Transport

Gautheron

Redko

Lartizien

2019

View full text Add to dashboard Cite

In this paper, we propose a new feature selection method for unsupervised domain adaptation based on the emerging optimal transportation theory. We build upon a recent theoretical analysis of optimal transport in domain adaptation and show that it can directly suggest a feature selection procedure leveraging the shift between the domains. Based on this, we propose a novel algorithm that aims to sort features by their similarity across the source and target domains, where the order is obtained by analyzing the coupling matrix representing the solution of the proposed optimal transportation problem. We evaluate our method on a well-known benchmark data set and illustrate its capability of selecting correlated features leading to better classification performances. Furthermore, we show that the proposed algorithm can be used as a pre-processing step for existing domain adaptation techniques ensuring an important speed-up in terms of the computational time while maintaining comparable results. Finally, we validate our algorithm on clinical imaging databases for computer-aided diagnosis task with promising results. 512 512 4096A→C 74.9±2.0 29.8±2.4 71.7±3.5

show abstract

Improving Few-Shot Learning through Multi-task Representation Learning Theory

Bouniot¹,

Redko²,

Audigier³

et al. 2020

Preprint

View full text Add to dashboard Cite

Cross-Lingual Document Retrieval Using Regularized Wasserstein Distance

Balikas

Laclau

Redko

et al. 2018

View full text Add to dashboard Cite

Many information retrieval algorithms rely on the notion of a good distance that allows to efficiently compare objects of different nature. Recently, a new promising metric called Word Mover's Distance was proposed to measure the divergence between text passages. In this paper, we demonstrate that this metric can be extended to incorporate term-weighting schemes and provide more accurate and computationally efficient matching between documents using entropic regularization. We evaluate the benefits of both extensions in the task of cross-lingual document retrieval (CLDR). Our experimental results on eight CLDR problems suggest that the proposed methods achieve remarkable improvements in terms of Mean Reciprocal Rank compared to several baselines.

show abstract

Optimal Transport for Multi-source Domain Adaptation under Target Shift

Redko¹,

Courty²,

Flamary³

et al. 2018

Preprint

View full text Add to dashboard Cite

In this paper, we tackle the problem of reducing discrepancies between multiple domains, i.e. multi-source domain adaptation, and consider it under the target shift assumption: in all domains we aim to solve a classification problem with the same output classes, but with different labels proportions. This problem, generally ignored in the vast majority of domain adaptation papers, is nevertheless critical in real-world applications, and we theoretically show its impact on the success of the adaptation. Our proposed method is based on optimal transport, a theory that has been successfully used to tackle adaptation problems in machine learning. The introduced approach, Joint Class Proportion and Optimal Transport (JCPOT), performs multi-source adaptation and target shift correction simultaneously by learning the class probabilities of the unlabeled target sample and the coupling allowing to align two (or more) probability distributions. Experiments on both synthetic and real-world data (satellite image pixel classification) task show the superiority of the proposed method over the state-of-the-art.

show abstract

Domain Adaptation Problem

Redko¹,

Habrard²,

Morvant³

et al. 2019

View full text Add to dashboard Cite

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ievgen Redko

Theoretical Analysis of Domain Adaptation with Optimal Transport

A survey on domain adaptation theory: learning bounds and theoretical guarantees

Iterative Domain Adaptation Methods

Feature Selection for Unsupervised Domain Adaptation Using Optimal Transport

Improving Few-Shot Learning through Multi-task Representation Learning Theory

Cross-Lingual Document Retrieval Using Regularized Wasserstein Distance

Optimal Transport for Multi-source Domain Adaptation under Target Shift

Domain Adaptation Problem

Contact Info

Product

Resources

About