2021
DOI: 10.48550/arxiv.2104.07284
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Consistency Training with Virtual Adversarial Discrete Perturbation

Abstract: We propose an effective consistency training framework that enforces a training model's predictions given original and perturbed inputs to be similar by adding a discrete noise that would incur the highest divergence between predictions. This virtual adversarial discrete noise obtained by replacing a small portion of tokens while keeping original semantics as much as possible efficiently pushes a training model's decision boundary. Moreover, we perform an iterative refinement process to alleviate the degraded … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 33 publications
0
1
0
Order By: Relevance
“…However, the consistency training framework is also applicable when only the labeled samples are available (Miyato et al, 2018;Jiang et al, 2019;Asai and Hajishirzi, 2020). The consistency regularization requires adding noise to the sample, which can be either discrete (Xie et al, 2020;Asai and Hajishirzi, 2020;Park et al, 2021) or continuous (Miyato et al, 2016;Jiang et al, 2019). Existing works regularize the predictions of the perturbed samples to be equivalent to be that of the originals'.…”
Section: Related Workmentioning
confidence: 99%
“…However, the consistency training framework is also applicable when only the labeled samples are available (Miyato et al, 2018;Jiang et al, 2019;Asai and Hajishirzi, 2020). The consistency regularization requires adding noise to the sample, which can be either discrete (Xie et al, 2020;Asai and Hajishirzi, 2020;Park et al, 2021) or continuous (Miyato et al, 2016;Jiang et al, 2019). Existing works regularize the predictions of the perturbed samples to be equivalent to be that of the originals'.…”
Section: Related Workmentioning
confidence: 99%