AugVic: Exploiting BiText Vicinity for Low-Resource NMT

Mohiuddin, Tasnim; Bari, Mehwish; Joty, Shafiq

doi:10.18653/v1/2021.findings-acl.267

“…Kiyono 2021, Karpukhin et al 2019). An advantage of soft replacements over hard ones is that they take into account the context of the tokens being replaced (Liu et al, 2021;Mohiuddin et al, 2021). These methods require architectural changes to a model whereas CipherDAug does not.…”

Section: Related Workmentioning

confidence: 99%

CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation

Kambhatla¹,

Born²,

Sarkar³

2022

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

View full text Add to dashboard Cite

We propose a novel data-augmentation technique for neural machine translation based on ROT-k ciphertexts. ROT-k is a simple letter substitution cipher that replaces a letter in the plaintext with the kth letter after it in the alphabet. We first generate multiple ROT-k ciphertexts using different values of k for the plaintext which is the source side of the parallel data. We then leverage this enciphered training data along with the original parallel data via multi-source training to improve neural machine translation. Our method, CipherDAug, uses a co-regularization-inspired training procedure, requires no external data sources other than the original training data, and uses a standard Transformer to outperform strong data augmentation techniques on several datasets by a significant margin. This technique combines easily with existing approaches to data augmentation, and yields particularly strong results in low-resource settings. 1

show abstract

“…Kiyono 2021, Karpukhin et al 2019). An advantage of soft replacements over hard ones is that they take into account the context of the tokens being replaced Mohiuddin et al, 2021). These methods require architectural changes to a model whereas CipherDAug does not.…”

Section: Related Workmentioning

confidence: 99%

CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation

Kambhatla¹,

Born²,

Sarkar³

2022

Preprint

0

View full text Add to dashboard Cite

We propose a novel data-augmentation technique for neural machine translation based on ROT-k ciphertexts. ROT-k is a simple letter substitution cipher that replaces a letter in the plaintext with the kth letter after it in the alphabet. We first generate multiple ROT-k ciphertexts using different values of k for the plaintext which is the source side of the parallel data. We then leverage this enciphered training data along with the original parallel data via multi-source training to improve neural machine translation. Our method, CipherDAug, uses a co-regularization-inspired training procedure, requires no external data sources other than the original training data, and uses a standard Transformer to outperform strong data augmentation techniques on several datasets by a significant margin. This technique combines easily with existing approaches to data augmentation, and yields particularly strong results in low-resource settings. 1

show abstract

“…In Chapter 3 we leverage language models to generate synthetic labeled sequences for sequence tagging task data augmentation. Data augmentation has also been proven to be useful in the crosslingual settings [68][69][70][71][72][73]. Moreover most of the exiting methods overlook the better utilization of multilingual training data when such resources are available, so in Chapter 3 we explore methods to exploit translation models and multilingual resources for data augmentation.…”

Section: Data Augmentationmentioning

confidence: 99%

Towards robust natural language and image processing In low-resource scenarios

Liu¹

0

View full text Add to dashboard Cite

Comparison of the representations obtained at each layer before (Base) and after adapter-based tuning or fine-tuning on BERT-base using Representational Similarity Analysis (RSA). . . . . . . . . 4.3 Test performance w.r.t the number of training examples. Reported results are averages across five runs with different random seeds. . 4.4 Box plots of test performance distribution over 20 runs across different learning rates.

show abstract

AugVic: Exploiting BiText Vicinity for Low-Resource NMT

Cited by 4 publications

References 43 publications

CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation

CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation

CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation

Towards robust natural language and image processing In low-resource scenarios

Contact Info

Product

Resources

About