Active Learning for Interactive Neural Machine Translation of Data Streams

Peris, Álvaro; Casacuberta, Francisco

doi:10.18653/v1/k18-1015

Cited by 32 publications

(49 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Considering the -greedy-like strategy of the regulator and the strong role of the cost factor shown in Figure 4, the regulator module does not appear to choose individual actions based e.g., on the difficulty of inputs, but rather composes mini-batches with a feedback ratio according to the feedback type's statistics. This confirms the observations of Peris and Casacuberta (2018), who find that the subset of instances selected for labeling is secondaryit is rather the mixing ratio of feedback types that matters. This finding is also consistent with the mini-batch update regime that forces the regulator to take a higher-level perspective and optimize the expected improvement at the granularity of (minibatch) updates rather than at the input level.…”

Section: Resultssupporting

confidence: 88%

Self-Regulated Interactive Sequence-to-Sequence Learning

Kreutzer

Riezler

2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

Not all types of supervision signals are created equal: Different types of feedback have different costs and effects on learning. We show how self-regulation strategies that decide when to ask for which kind of feedback from a teacher (or from oneself) can be cast as a learning-to-learn problem leading to improved cost-aware sequence-to-sequence learning. In experiments on interactive neural machine translation, we find that the selfregulator discovers an -greedy strategy for the optimal cost-quality trade-off by mixing different feedback types including corrections, error markups, and self-supervision. Furthermore, we demonstrate its robustness under domain shift and identify it as a promising alternative to active learning.

show abstract

Section: Resultssupporting

confidence: 88%

Self-Regulated Interactive Sequence-to-Sequence Learning

Kreutzer

Riezler

2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…A key component of AL is the choice of the sampling strategy, which curates the samples in order to maximize the model's performance with a minimum amount of user interaction. Many AL sampling strategies have proven effective for human-supervised natural language processing tasks other than compression (Hahn et al, 2012;Peris and Casacuberta, 2018;Liu et al, 2018).…”

Section: Introductionmentioning

confidence: 99%

Data-efficient Neural Text Compression with Interactive Learning

P.V.S¹,

Meyer

2019

Proceedings of the 2019 Conference of the North

View full text Add to dashboard Cite

Neural sequence-to-sequence models have been successfully applied to text compression. However, these models were trained on huge automatically induced parallel corpora, which are only available for a few domains and tasks. In this paper, we propose a novel interactive setup to neural text compression that enables transferring a model to new domains and compression tasks with minimal human supervision. This is achieved by employing active learning, which intelligently samples from a large pool of unlabeled data. Using this setup, we can successfully adapt a model trained on small data of 40k samples for a headline generation task to a general text compression dataset at an acceptable compression quality with just 500 sampled instances annotated by a human.

show abstract

“…Following the spirit of the Keras library, we developed NMT-Keras, released under MIT license, that aims to provide a highly-modular and extensible framework to NMT. https://github.com/lvapeab/nmt-keras NMT-Keras supports advanced features, including support of interactive-predictive NMT (INMT) (Barrachina et al, 2009;Peris et al, 2017c) protocols, continuous adaptation (Peris et al, 2017a) and active learning (Peris and Casacuberta, 2018b) strategies. An additional goal, is to ease the usage of the library, but allowing the user to configure most of the options involving the NMT process.…”

Section: Introductionmentioning

confidence: 99%

NMT-Keras: a Very Flexible Toolkit with a Focus on Interactive NMT and Online Learning

Peris¹,

Casacuberta²

2018

The Prague Bulletin of Mathematical Linguistics

Self Cite

View full text Add to dashboard Cite

We present NMT-Keras, a flexible toolkit for training deep learning models, which puts a particular emphasis on the development of advanced applications of neural machine translation systems, such as interactive-predictive translation protocols and long-term adaptation of the translation system via continuous learning. NMT-Keras is based on an extended version of the popular Keras library, and it runs on Theano and TensorFlow. State-of-the-art neural machine translation models are deployed and used following the high-level framework provided by Keras. Given its high modularity and flexibility, it also has been extended to tackle different problems, such as image and video captioning, sentence classification and visual question answering.

show abstract

Active Learning for Interactive Neural Machine Translation of Data Streams

Cited by 32 publications

References 39 publications

Self-Regulated Interactive Sequence-to-Sequence Learning

Self-Regulated Interactive Sequence-to-Sequence Learning

Data-efficient Neural Text Compression with Interactive Learning

NMT-Keras: a Very Flexible Toolkit with a Focus on Interactive NMT and Online Learning

Contact Info

Product

Resources

About