Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER

Keung, Phillip; Lu, Yichao; Bhardwaj, Vikas

doi:10.18653/v1/d19-1138

Cited by 56 publications

(91 citation statements)

References 18 publications

(21 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Artetxe et al [1] pretrain a massively multilingual sequence-to-sequence neural MT model, invoking its encoder as a multilingual text representation used for fine-tuning on downstream tasks. Keung et al [10] apply languageadversarial learning into Multilingual BERT during fine-tuning with unlabeled data. We also considered Multilingual BERT and itself with self-learning and adversarial training respectively as our baselines.…”

Section: Results and Analysismentioning

confidence: 99%

Leveraging Adversarial Training in Self-Learning for Cross-Lingual Text Classification

Dong

Zhu

Zhang

et al. 2020

Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

View full text Add to dashboard Cite

In cross-lingual text classification, one seeks to exploit labeled data from one language to train a text classification model that can then be applied to a completely different language. Recent multilingual representation models have made it much easier to achieve this. Still, there may still be subtle differences between languages that are neglected when doing so. To address this, we present a semisupervised adversarial training process that minimizes the maximal loss for label-preserving input perturbations. The resulting model then serves as a teacher to induce labels for unlabeled target language samples that can be used during further adversarial training, allowing us to gradually adapt our model to the target language. Compared with a number of strong baselines, we observe significant gains in effectiveness on document and intent classification for a diverse set of languages. CCS CONCEPTS • Computing methodologies → Natural language processing; • Information systems → Document representation.

show abstract

Section: Results and Analysismentioning

confidence: 99%

Leveraging Adversarial Training in Self-Learning for Cross-Lingual Text Classification

Dong

Zhu

Zhang

et al. 2020

Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

View full text Add to dashboard Cite

show abstract

“…They leverage the benefit of contextualized word embeddings by using multilingual BERT (Devlin et al, 2019) as the feature generator, and adopt the GAN framework (Goodfellow et al, 2014) to align the features from the two domains. Keung et al (2019) show significant improvement over the baseline where the pretrained multilingual BERT is finetuned on the English data alone and testing on the same tasks in other languages. However, Keung et al (2019), as well as the works mentioned above, are inspired by the pioneering work of Ben-David et al (2010), which only rigorously studies domain adaptation in the setting of binary classification; there is a lack of theoretical guarantees when it comes to multiclass classification.…”

Section: Introductionmentioning

confidence: 89%

“…Unsupervised domain adaptation provides an appealing solution to many applications where direct access to a massive amount of labeled data is prohibitive or very costly (Sun and Saenko, 2014;Vazquez et al, 2013;Stark et al, 2010;Keung et al, 2019). For example, we often have sufficient labeled data for English, while very limited or even no labeled data are available for many other languages.…”

Section: Introductionmentioning

confidence: 99%

“…Many recent successes in unsupervised domain adaptation have been achieved by learning domain invariant features that are simultaneously being discriminative to the task in the source domain (Chen et al, 2018;Ganin and Lempitsky, 2014;Ganin et al, 2016;Tzeng et al, 2017). Following this line, Keung et al (2019) propose a language-adversarial training approach for cross-lingual document classification and NER. They leverage the benefit of contextualized word embeddings by using multilingual BERT (Devlin et al, 2019) as the feature generator, and adopt the GAN framework (Goodfellow et al, 2014) to align the features from the two domains.…”

Section: Introductionmentioning

confidence: 99%

“…In this work, we are instead motivated by a recent work (Zhang et al, 2019b) that focuses on the theoretical analysis of unsupervised domain adaption for multiclass classification and provides explicit guidance for algorithm design. Instead of training a discriminator that predicts if the representations are from the source domain or the target domain (Keung et al, 2019;Ganin and Lempitsky, 2014;Ganin et al, 2016), Zhang et al (2019b) proposes to optimize an auxiliary classifier which, together with the classifier, minimizes the discrepancy between the two domains via adversarial training. We apply this approach to cross-lingual text labeling tasks, which, as demonstrated in Section 4, outperforms Keung et al (2019) by a large margin.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Margin-aware Unsupervised Domain Adaptation for Cross-lingual Text Labeling

Zhang

Nallapati

Zhu

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

Unsupervised domain adaptation addresses the problem of leveraging labeled data in a source domain to learn a well-performing model in a target domain where labels are unavailable. In this paper, we improve upon a recent theoretical work (Zhang et al., 2019b) and adopt the Margin Disparity Discrepancy (MDD) unsupervised domain adaptation algorithm to solve the cross-lingual text labeling problems. Experiments on cross-lingual document classification and NER demonstrate the proposed domain adaptation approach advances the stateof-the-art results by a large margin. Specifically, we improve MDD by efficiently optimizing the margin loss on the source domain via Virtual Adversarial Training (VAT). This bridges the gap between theory and the loss function used in the original work Zhang et al. (2019b), and thereby significantly boosts the performance. Our numerical results also indicate that VAT can remarkably improve the generalization performance of both domains for various domain adaptation approaches.

show abstract

Cross-Lingual Name Entity Recognition from Clinical Text Using Mixed Language Query

Shi,

Chen,

et al. 2024

Communications in Computer and Information Science

View full text Add to dashboard Cite

Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER

Cited by 56 publications

References 18 publications

Leveraging Adversarial Training in Self-Learning for Cross-Lingual Text Classification

Leveraging Adversarial Training in Self-Learning for Cross-Lingual Text Classification

Margin-aware Unsupervised Domain Adaptation for Cross-lingual Text Labeling

Cross-Lingual Name Entity Recognition from Clinical Text Using Mixed Language Query

Contact Info

Product

Resources

About