Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples

Yoo, Jin Yong; Morris, John X.; Lifland, Eli; Qi, Yanjun

doi:10.48550/arxiv.2009.06368

Cited by 8 publications

(23 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Yoo et al [15] presented review of search algorithms for generating adversarial examples as a means to achieve robustness. However, their work is limited in scope in that it only focuses on adversarial examples as a means to seek robustness.…”

Section: Related Workmentioning

confidence: 99%

“…3) Sparse Projected Gradient Descent: The projected gradient descent method is a type of greedy algorithm which has been applied broadly to machine learning models [36]. In this method, each element in the input text is considered for substitution and the best perturbations are selected from all possible perturbations and rerun until no more perturbations are possible [15]. This attack method has been utilized in several research works [15], [43], [90] with various promising results.…”

Section: Breaching Security By Improving Attacksmentioning

confidence: 99%

“…In this method, each element in the input text is considered for substitution and the best perturbations are selected from all possible perturbations and rerun until no more perturbations are possible [15]. This attack method has been utilized in several research works [15], [43], [90] with various promising results. For example, Barham et al [43] introduced a sparse projected gradient descent (SPGD) method for crafting interpretable AEs for text applications.…”

Section: Breaching Security By Improving Attacksmentioning

confidence: 99%

“…In this method, each member of the population is perturbed by creating all potential candidate obtained by replacing each input and then sampling one input example, at each iteration. Using this algorithm, we are able to find the best perturbed input among all members of the population [15]. This attack technique has been used in multiple studies addressing machine learning robustness to adversarial attacks in general, including the studies in [12], [15], [53], [96], [97].…”

Section: Breaching Security By Improving Attacksmentioning

confidence: 99%

“…Broadly speaking, such efforts in the literature are either focused on developing new attacks or better training models to make models resistant to such attacks (i.e., defenses) [13]. To sum up the research efforts dedicated understanding robustness in the literature, there are several research surveys that have addressed specific aspects of NLP robustness, e.g., data augmentation [14], search methods [15], pretrained models [16], and adversarial attacks [17]. However, the literature lacks research studies that provide a systematic overview of the state-of-the-art in this space across a range of variables; applications, technique, metrics, benchmark datasets, threat models, tasks, embedding techniques, learning techniques, goals, defense mechanisms, and performance.…”

mentioning

confidence: 99%

See 4 more Smart Citations

Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions

Omar¹,

Choi²,

Nyang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Recent natural language processing (NLP) techniques have accomplished high performance on benchmark datasets, primarily due to the significant improvement in the performance of deep learning. The advances in the research community have led to great enhancements in state-of-the-art production systems for NLP tasks, such as virtual assistants, speech recognition, and sentiment analysis. However, such NLP systems still often fail when tested with adversarial attacks. The initial lack of robustness exposed troubling gaps in current models' language understanding capabilities, creating problems when NLP systems are deployed in real life. In this paper, we present a structured overview of NLP robustness research by summarizing the literature in a systemic way across various dimensions. We then take a deep-dive into the various dimensions of robustness, across techniques, metrics, embeddings, and benchmarks. Finally, we argue that robustness should be multi-dimensional, provide insights into current research, identify gaps in the literature to suggest directions worth pursuing to address these gaps.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Breaching Security By Improving Attacksmentioning

confidence: 99%

Section: Breaching Security By Improving Attacksmentioning

confidence: 99%

Section: Breaching Security By Improving Attacksmentioning

confidence: 99%

mentioning

confidence: 99%

See 3 more Smart Citations

Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions

Omar¹,

Choi²,

Nyang³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions

et al. 2022

View full text Add to dashboard Cite

Recent natural language processing (NLP) techniques have accomplished high performance on benchmark data sets, primarily due to the significant improvement in the performance of deep learning. The advances in the research community have led to great enhancements in state-of-the-art production systems for NLP tasks, such as virtual assistants, speech recognition, and sentiment analysis. However, such NLP systems still often fail when tested with adversarial attacks. The initial lack of robustness exposed troubling gaps in current models' language understanding capabilities, creating problems when NLP systems are deployed in real life. In this paper, we present a structured overview of NLP robustness research by summarizing the literature in a systemic way across various dimensions. We then take a deepdive into the various dimensions of robustness, across techniques, metrics, embedding, and benchmarks. Finally, we argue that robustness should be multi-dimensional, provide insights into current research, identify gaps in the literature to suggest directions worth pursuing to address these gaps INDEX TERMS Natural Language Processing; Adversarial Attacks; Robustness.

show abstract

Searching for an Effective Defender: Benchmarking Defense against Adversarial Word Substitution

Xu²,

Zeng

et al. 2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Recent studies have shown that deep neural network-based models are vulnerable to intentionally crafted adversarial examples, and various methods have been proposed to defend against adversarial word-substitution attacks for neural NLP models. However, there is a lack of systematic study on comparing different defense approaches under the same attacking setting. In this paper, we seek to fill the gap through comprehensive studies on the behavior of neural text classifiers trained with various defense methods against representative adversarial attacks. In addition, we propose an effective method to further improve the robustness of neural text classifiers against such attacks, and achieved the highest accuracy on both clean and adversarial examples on AGNEWS and IMDB datasets, outperforming existing methods by a significant margin. We hope this study could provide useful clues for future research on text adversarial defense. Codes are available at https:// github.com/RockyLzy/TextDefender.

show abstract

Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples

Cited by 8 publications

References 22 publications

Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions

Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions

Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions

Searching for an Effective Defender: Benchmarking Defense against Adversarial Word Substitution

Contact Info

Product

Resources

About