Siyu Lai scite author profile

Siyu Lai

3Publications

2Citation Statements Received

46Citation Statements Given

How they've been cited

How they cite others

Affiliations

Beijing Jiaotong University

Publications

Order By: Most citations

Saliency-based Multi-View Mixed Language Training for Zero-shot Cross-lingual Classification

Lai

Huang

Dong³

et al. 2021

View full text Add to dashboard Cite

Recent multilingual pre-trained models, like XLM-RoBERTa (XLM-R), have been demonstrated effective in many cross-lingual tasks. However, there are still gaps between the contextualized representations of similar words in different languages. To solve this problem, we propose a novel framework named Multi-View Mixed Language Training (MVMLT), which leverages code-switched data with multi-view learning to fine-tune XLM-R. MVMLT uses gradient-based saliency to extract keywords which are the most relevant to downstream tasks and replaces them with the corresponding words in the target language dynamically. Furthermore, MVMLT utilizes multiview learning to encourage contextualized embeddings to align into a more refined language-invariant space. Extensive experiments with four languages show that our model achieves state-of-the-art results on zeroshot cross-lingual sentiment classification and dialogue state tracking tasks, demonstrating the effectiveness of our proposed model 1 .

show abstract

Neural Machine Translation Based on Back-Translation for Multilingual Translation Evaluation Task

Lai

Yang

et al. 2020

View full text Add to dashboard Cite

Supervised Contrastive Learning for Cross-Lingual Transfer Learning

Wang

Di²,

Huang³

et al. 2022

View full text Add to dashboard Cite

Multilingual pre-trained representations are not well-aligned by nature, which harms their performance on cross-lingual tasks. Previous methods propose to post-align the multilingual pretrained representations by multi-view alignment or contrastive learning. However, we argue that both methods are not suitable for the cross-lingual classification objective, and in this paper we propose a simple yet effective method to better align the pre-trained representations. On the basis of cross-lingual data augmentations, we make a minor modification to the canonical contrastive loss, to remove false-negative examples which should not be contrasted. Augmentations with the same class are brought close to the anchor sample, and augmentations with different class are pushed apart. Experiment results on three cross-lingual tasks from XTREME benchmark show our method could improve the transfer performance by a large margin with no additional resource needed. We also provide in-detail analysis and comparison between different post-alignment strategies.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Siyu Lai

Saliency-based Multi-View Mixed Language Training for Zero-shot Cross-lingual Classification

Neural Machine Translation Based on Back-Translation for Multilingual Translation Evaluation Task

Supervised Contrastive Learning for Cross-Lingual Transfer Learning

Contact Info

Product

Resources

About