Rethinking Importance Weighting for Transfer Learning

Lu, N.; Zhang, Tianyi; Fang, Tongtong; Teshima, Takeshi; Sugiyama, Masashi

doi:10.48550/arxiv.2112.10157

Cited by 1 publication

(1 citation statement)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Given its importance in real-world applications, the problem of how to learn from shifting distributions has been widely studied. Much past work has focused on a single shift between training/test data (Lu et al, 2021;Wang & Deng, 2018;Fakoor et al, 2020b) as well as restricted forms of shift involving changes in only the features (Sugiyama et al, 2007a;Reddi et al, 2015a), labels (Lipton et al, 2018;Garg et al, 2020;Alexandari et al, 2020), or in the underlying relationship between the two (Zhang et al, 2013;Lu et al, 2018). Past approaches to handle distributions evolving over time have been considered in the literature on: concept drift Gomes et al ( 2019); Souza et al (2020), reinforcement learning (shift between the target policy and behavior policy) Schulman et al (2015); Wang et al (2016);Fakoor et al (2020a), (meta) online learning Shalev-Shwartz (2012); Finn et al (2019); Harrison et al (2020); Wu et al (2021), and task-free continual/incremental learning Aljundi et al (2019); He et al (2019), but to our knowledge, existing methods for these settings do not employ time-varying data weights like we propose here.…”

Section: Related Workmentioning

confidence: 99%

Learning under Data Drift with Time-Varying Importance Weights

Fakoor¹,

Mueller²,

Chaudhari³

et al. 2022

Preprint

View full text Add to dashboard Cite

Real-world deployment of machine learning models is challenging when data evolves over time. And data does evolve over time. While no model can work when data evolves in an arbitrary fashion, if there is some pattern to these changes, we might be able to design methods to address it. This paper addresses situations when data evolves gradually. We introduce a novel time-varying importance weight estimator that can detect gradual shifts in the distribution of data. Such an importance weight estimator allows the training method to selectively sample past data-not just similar data from the past like a standard importance weight estimator would but also data that evolved in a similar fashion in the past. Our time-varying importance weight is quite general. We demonstrate different ways of implementing it that exploit some known structure in the evolution of data. We demonstrate and evaluate this approach on a variety of problems ranging from supervised learning tasks (multiple image classification datasets) where the data undergoes a sequence of gradual shifts of our design to reinforcement learning tasks (robotic manipulation and continuous control) where data undergoes a shift organically as the policy or the task changes.

show abstract