Dynamic Ensemble Active Learning: A Non-Stationary Bandit with Expert Advice

Pang, Kunkun; Dong, Mingzhi; Wu, Yang; Hospedales, Timothy M.

doi:10.1109/icpr.2018.8545422

Cited by 16 publications

(16 citation statements)

References 29 publications

(74 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Their approach performed in line with the best two individual query strategies and outperformed Hsu and Lin (2015). Pang et al (2018) proposed a modification of multi-armed bandits with experts, to account for non-stationary loss functions (i.e., the best expert might vary over time), in binary and multi-class classification tasks. Their approach outperformed or performed in line with the best individual strategies and outperformed both Baram, El-Yaniv, and Luz (2004) and Hsu and Lin (2015) in non-stationary datasets.…”

Section: Combining Online Learning and Active Learningmentioning

confidence: 99%

“…Since our end goal is to improve the MT ensemble, we measure its improvement at each weight update by considering the expected regret R M for not choosing the best MT system's translation at each iteration, up to the current iteration T (Eq. 9), which can be seen as a dynamic regret (Pang et al 2018). Note that this regret formulation deviates from the traditional formulation, in that we compare the forecaster to the best sequence of decisions overall (whose cumulative loss is given by T t=1 min j=1,...,J j,t ), instead of the best expert overall (whose cumulative loss would be given by min j=1,...,J T t=1 j,t ).…”

Section: Figurementioning

confidence: 99%

See 1 more Smart Citation

Onception: Active Learning with Expert Advice for Real World Machine Translation

Mendonça¹,

Rei²,

Coheur³

et al. 2022

Preprint

View full text Add to dashboard Cite

Active learning can play an important role in low-resource settings (i.e., where annotated data is scarce), by selecting which instances may be more worthy to annotate. Most active learning approaches for Machine Translation assume the existence of a pool of sentences in a source language, and rely on human annotators to provide translations or post-edits, which can still be costly. In this article, we assume a real world human-in-the-loop scenario in which: (1) the source sentences may not be readily available, but instead arrive in a stream; (2) the automatic translations receive feedback in the form of a rating, instead of a correct/edited translation, since the human-in-the-loop might be a user looking for a translation, but not be able to provide one. To tackle the challenge of deciding whether each incoming pair source-translations is worthy to query for human feedback, we resort to a number of stream-based active learning query strategies. Moreover, since we not know in advance which query strategy will be the most adequate for a certain language pair and set of Machine Translation models, we propose to dynamically combine multiple strategies using prediction with expert advice. Our experiments show that using active learning allows to converge to the best Machine Translation systems with fewer human interactions. Furthermore, combining multiple strategies using prediction with expert advice often outperforms several individual active learning strategies with even fewer interactions.

show abstract

Section: Combining Online Learning and Active Learningmentioning

confidence: 99%

Section: Figurementioning

confidence: 99%

Onception: Active Learning with Expert Advice for Real World Machine Translation

Mendonça¹,

Rei²,

Coheur³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Many other data-driven approaches for pool-based AL processes have been proposed recently. While Bachman et al (2017) and Pang et al (2018b) used RL to build the learning model, Liu et al (2018) formulated learning AL strategies as an imitation learning problem (i.e., the machine is trained to perform a task from demonstrations by learning a mapping between observations and actions), Contardo et al (2017) and Ravi and Larochelle (2018) Pang et al (2018a) extended the LSA approach using non-stationary multi-armed bandit with expert advice.…”

Section: Strategy-free Approachesmentioning

confidence: 99%

A review and experimental analysis of active learning over crowdsourced data

et al. 2021

View full text Add to dashboard Cite

Training data creation is increasingly a key bottleneck for developing machine learning, especially for deep learning systems. Active learning provides a cost-effective means for creating training data by selecting the most informative instances for labeling. Labels in real applications are often collected from crowdsourcing, which engages online crowds for data labeling at scale. Despite the importance of using crowdsourced data in the active learning process, an analysis of how the existing active learning approaches behave over crowdsourced data is currently missing. This paper aims to fill this gap by reviewing the existing active learning approaches and then testing a set of benchmarking ones on crowdsourced datasets. We provide a comprehensive and systematic survey of the recent research on active learning in the hybrid human–machine classification setting, where crowd workers contribute labels (often noisy) to either directly classify data instances or to train machine learning models. We identify three categories of state of the art active learning methods according to whether and how predefined queries employed for data sampling, namely fixed-strategy approaches, dynamic-strategy approaches, and strategy-free approaches. We then conduct an empirical study on their cost-effectiveness, showing that the performance of the existing active learning approaches is affected by many factors in hybrid classification contexts, such as the noise level of data, label fusion technique used, and the specific characteristics of the task. Finally, we discuss challenges and identify potential directions to design active learning strategies for hybrid classification problems.

show abstract

“…Conversely, selecting an example in an already sampled region allows to locally refine the predictive model. We do not intend to provide an exhaustive overview of existing AL strategies and refer to [37], [38] for a detailed overview, [39]- [41] for some recent benchmark and a new way to treat uncertainty in [42] Another meta active learning paradigm exists, which combines conventional strategies using bandit algorithms [43]- [48]. These meta-learning algorithms intend to select online the best AL strategy according to the observed improvements of the classifier.…”

Section: B Axis 2: Inexact Supervision -Labels At the Right Proxy Vs ...mentioning

confidence: 99%

From Weakly Supervised Learning to Biquality Learning: an Introduction

Nodet,

Lemaire,

Bondu

et al. 2020

Preprint

View full text Add to dashboard Cite

The field of Weakly Supervised Learning (WSL) has recently seen a surge of popularity, with numerous papers addressing different types of "supervision deficiencies". In WSL use cases, a variety of situations exists where the collected "information" is imperfect. The paradigm of WSL attempts to list and cover these problems with associated solutions. In this paper, we review the research progress on WSL with the aim to make it as a brief introduction to this field. We present the three axis of WSL cube and an overview of most of all the elements of their facets. We propose three measurable quantities that acts as coordinates in the previously defined cube namely: Quality, Adaptability and Quantity of information. Thus we suggest that Biquality Learning framework can be defined as a plan of the WSL cube and propose to re-discover previously unrelated patches in WSL literature as a unified Biquality Learning literature.

show abstract

Dynamic Ensemble Active Learning: A Non-Stationary Bandit with Expert Advice

Cited by 16 publications

References 29 publications

Onception: Active Learning with Expert Advice for Real World Machine Translation

Onception: Active Learning with Expert Advice for Real World Machine Translation

A review and experimental analysis of active learning over crowdsourced data

From Weakly Supervised Learning to Biquality Learning: an Introduction

Contact Info

Product

Resources

About