Crafting Adversarial Examples for Neural Machine Translation

Zhang, Xinze; Zhang, Junzhe; Chen, Zhenhua; He, Kun

doi:10.18653/v1/2021.acl-long.153

Cited by 28 publications

(18 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Even though the success rate of concatenation attack lags behind the state-of-the-art textual attack, the manipulation attack achieves performance of the same ballpark, which demonstrates the efficacy of optimization-based attack and our solvers. More importantly, it implies that the attack is not transferable between the two tasks, documenting more evidence on language attack transferability (Yuan et al, 2021;He et al, 2021). The bottom line is that they are two different tasks under different assumptions.…”

Section: Resultsmentioning

confidence: 91%

“…Our code is available at https://github.com/yonxie/ AdvFinTweet It is now known that text-based deep learning models can be vulnerable to adversarial attacks (Szegedy et al, 2014;Goodfellow et al, 2015). The perturbation can be at the sentence level (e.g., Xu et al, 2021;Iyyer et al, 2018;Ribeiro et al, 2018), the word level (e.g., Zhang et al, 2019;Alzantot et al, 2018;Zang et al, 2020;Jin et al, 2020;Lei et al, 2019;Lin et al, 2021), or both (Chen et al, 2021). We are interested in whether such adversarial attack vulnerability also exists in stock prediction models, as these models embrace more and more human-generated media data (e.g., Twitter, Reddit, Stocktwit, Yahoo News (Xu and Cohen, 2018;Sawhney et al, 2021)).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction

Xie¹,

Wang²,

Chen³

et al. 2022

Preprint

View full text Add to dashboard Cite

More and more investors and machine learning models rely on social media (e.g., Twitter and Reddit) to gather real-time information and sentiment to predict stock price movements. Although text-based models are known to be vulnerable to adversarial attacks, whether stock prediction models have similar vulnerability is underexplored. In this paper, we experiment with a variety of adversarial attack configurations to fool three stock prediction victim models. We address the task of adversarial generation by solving combinatorial optimization problems with semantics and budget constraints. Our results show that the proposed attack method can achieve consistent success rates and cause significant monetary loss in trading simulation by simply concatenating a perturbed but semantically similar tweet.

show abstract

Section: Resultsmentioning

confidence: 91%

Section: Introductionmentioning

confidence: 99%

A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction

Xie¹,

Wang²,

Chen³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…The previous approaches for constructing NMT adversarial examples can be divided into two branches: white-box and black-box. The whitebox approaches are based on the assumption that the architecture and parameters of the NMT model are accessible (Ebrahimi et al, 2018;Cheng et al, 2019;Chen et al, 2021). These methods usually achieve superior performance since they can construct and defend the adversaries tailored for the model.…”

Section: Adversarial Examples For Nmtmentioning

confidence: 99%

Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation

Lai¹,

Zhang²,

Meng³

et al. 2022

Preprint

View full text Add to dashboard Cite

Generating adversarial examples for Neural Machine Translation (NMT) with single Round-Trip Translation (RTT) has achieved promising results by releasing the meaningpreserving restriction. However, a potential pitfall for this approach is that we cannot decide whether the generated examples are adversarial to the target NMT model or the auxiliary backward one, as the reconstruction error through the RTT can be related to either. To remedy this problem, we propose a new criterion for NMT adversarial examples based on the Doubly Round-Trip Translation (DRTT). Specifically, apart from the sourcetarget-source RTT, we also consider the targetsource-target one, which is utilized to pick out the authentic adversarial examples for the target NMT model. Additionally, to enhance the robustness of the NMT model, we introduce the masked language models to construct bilingual adversarial pairs based on DRTT, which are used to train the NMT model directly. Extensive experiments on both the clean and noisy test sets (including the artificial and natural noise) show that our approach substantially improves the robustness of NMT models.

show abstract

“…Our code is available at https://github.com/yonxie/ AdvFinTweet It is now known that text-based deep learning models can be vulnerable to adversarial attacks (Szegedy et al, 2014;Goodfellow et al, 2015). The perturbation can be at the sentence level (e.g., Xu et al, 2021;Iyyer et al, 2018;Ribeiro et al, 2018), the word level (e.g., Zhang et al, 2019;Alzantot et al, 2018;Zang et al, 2020;Jin et al, 2020;Lei et al, 2019;Zhang et al, 2021;Lin et al, 2021), or both (Chen et al, 2021). We are interested in whether such adversarial attack vulnerability also exists in stock prediction models, as these models embrace more and more human-generated media data (e.g., Twitter, Reddit, Stocktwit, Yahoo News (Xu and Cohen, 2018;Sawhney et al, 2021)).…”

Section: Introductionmentioning

confidence: 99%

A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction

Xie¹,

Wang²,

Chen³

et al. 2022

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

show abstract

Crafting Adversarial Examples for Neural Machine Translation

Cited by 28 publications

References 17 publications

A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction

A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction

Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation

A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction

Contact Info

Product

Resources

About