Lightweight Cross-Lingual Sentence Representation Learning

Mao, Zhuoyuan; Gupta, Prakhar; Chu, Chenhui; Jäggi, Martin; Kurohashi, Sadao

doi:10.18653/v1/2021.acl-long.226

Cited by 3 publications

(1 citation statement)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, manually cleaned high-quality ground-truth bilingual dictionaries are used to pre-edit the source sentences, which are unavailable for most language pairs. Recently, contrastive objectives (Clark et al, 2020;Gunel et al, 2021;Giorgi et al, 2021;Wei et al, 2021;Mao et al, 2021) have been shown to be superior at leveraging alignment knowledge in various NLP tasks by contrasting the representations of positive and negative samples in a discriminative manner. This objective, which should be able to utilize word alignment learned by any toolkit, which in turn will remove the constraints of using manually constructed dictionaries, has not been explored in the context of leveraging word alignment for many-to-many NMT.…”

Section: Introductionmentioning

confidence: 99%

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

Mao¹,

Chu²,

Dabre³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Word alignment has proven to benefit many-tomany neural machine translation (NMT). However, high-quality ground-truth bilingual dictionaries were used for pre-editing in previous methods, which are unavailable for most language pairs. Meanwhile, the contrastive objective can implicitly utilize automatically learned word alignment, which has not been explored in many-to-many NMT. This work proposes a word-level contrastive objective to leverage word alignments for many-to-many NMT. Empirical results show that this leads to 0.8 BLEU gains for several language pairs. Analyses reveal that in many-to-many NMT, the encoder's sentence retrieval performance highly correlates with the translation quality, which explains when the proposed method impacts translation. This motivates future exploration for many-to-many NMT to improve the encoder's sentence retrieval performance.2 Word-level Contrastive Learning for Many-to-many NMT Inspired by the contrastive learning framework and the sentence-level con-

show abstract

Section: Introductionmentioning

confidence: 99%

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

Mao¹,

Chu²,

Dabre³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

EMS: Efficient and Effective Massively Multilingual Sentence Embedding Learning

Mao,

Chu,

Kurohashi

2024

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Enhancing Cross-Lingual Sarcasm Detection by a Prompt Learning Framework with Data Augmentation and Contrastive Learning

An,

Yan,

Zuo

et al. 2024

Electronics

View full text Add to dashboard Cite

Given their intricate nature and inherent ambiguity, sarcastic texts often mask deeper emotions, making it challenging to discern the genuine feelings behind the words. The proposal of the sarcasm detection task is to assist us with more accurately understanding the true intention of the speaker. Advanced methods, such as deep learning and neural networks, are widely used in the field of sarcasm detection. However, most research mainly focuses on sarcastic texts in English, as other languages lack corpora and annotated datasets. To address the challenge of low-resource languages in sarcasm detection tasks, a zero-shot cross-lingual transfer learning method is proposed in this paper. The proposed approach is based on prompt learning and aims to assist the model with understanding downstream tasks through prompts. Specifically, the model uses prompt templates to construct training data into cloze-style questions and then trains them using a pre-trained cross-lingual language model. Combining data augmentation and contrastive learning can further improve the capacity of the model for cross-lingual transfer learning. To evaluate the performance of the proposed model, we utilize a publicly accessible sarcasm dataset in English as training data in a zero-shot cross-lingual setting. When tested with Chinese as the target language for transfer, our model achieves F1-scores of 72.14% and 76.7% on two test datasets, outperforming the strong baselines by significant margins.

show abstract

Lightweight Cross-Lingual Sentence Representation Learning

Cited by 3 publications

References 29 publications

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

EMS: Efficient and Effective Massively Multilingual Sentence Embedding Learning

Enhancing Cross-Lingual Sarcasm Detection by a Prompt Learning Framework with Data Augmentation and Contrastive Learning

Contact Info

Product

Resources

About