Javid Ebrahimi scite author profile

We propose an efficient method to generate white-box adversarial examples to trick a character-level neural classifier. We find that only a few manipulations are needed to greatly decrease the accuracy. Our method relies on an atomic flip operation, which swaps one token for another, based on the gradients of the onehot input vectors. Due to efficiency of our method, we can perform adversarial training which makes the model more robust to attacks at test time. With the use of a few semantics-preserving constraints, we demonstrate that HotFlip can be adapted to attack a word-level classifier as well. Related WorkAdversarial examples are powerful tools to investigate the vulnerabilities of a deep learning model (Szegedy et al., 2014). While this line of research has recently received a lot of attention in

show abstract

HotFlip: White-Box Adversarial Examples for Text Classification

Ebrahimi

Rao

Lowd

et al. 2017

Preprint

111

151

View full text Add to dashboard Cite

Chain Based RNN for Relation Classification

Ebrahimi¹,

Dou²

2015

View full text Add to dashboard Cite

We present a novel approach for relation classification, using a recursive neural network (RNN), based on the shortest path between two entities in a dependency graph. Previous works on RNN are based on constituencybased parsing because phrasal nodes in a parse tree can capture compositionality in a sentence. Compared with constituency-based parse trees, dependency graphs can represent relations more compactly. This is particularly important in sentences with distant entities, where the parse tree spans words that are not relevant to the relation. In such cases RNN cannot be trained effectively in a timely manner. However, due to the lack of phrasal nodes in dependency graphs, application of RNN is not straightforward. In order to tackle this problem, we utilize dependency constituent units called chains. Our experiments on two relation classification datasets show that Chain based RNN provides a shallower network, which performs considerably faster and achieves better classification results.

show abstract

On Adversarial Examples for Character-Level Neural Machine Translation

Ebrahimi¹,

Lowd²,

Dou³

2018

Preprint

View full text Add to dashboard Cite

Evaluating on adversarial examples has become a standard procedure to measure robustness of deep learning models. Due to the difficulty of creating white-box adversarial examples for discrete text input, most analyses of the robustness of NLP models have been done through blackbox adversarial examples. We investigate adversarial examples for character-level neural machine translation (NMT), and contrast black-box adversaries with a novel white-box adversary, which employs differentiable string-edit operations to rank adversarial changes. We propose two novel types of attacks which aim to remove or change a word in a translation, rather than simply break the NMT. We demonstrate that white-box adversarial examples are significantly stronger than their black-box counterparts in different attack scenarios, which show more serious vulnerabilities than previously known. In addition, after performing adversarial training, which takes only 3 times longer than regular training, we can improve the model's robustness significantly.

show abstract

Weakly Supervised Tweet Stance Classification by Relational Bootstrapping

Ebrahimi¹,

Dou²,

Lowd³

2016

View full text Add to dashboard Cite

Supervised stance classification, in such domains as Congressional debates and online forums, has been a topic of interest in the past decade. Approaches have evolved from text classification to structured output prediction, including collective classification and sequence labeling. In this work, we investigate collective classification of stances on Twitter, using hinge-loss Markov random fields (HL-MRFs). Given the graph of all posts, users, and their relationships, we constrain the predicted post labels and latent user labels to correspond with the network structure. We focus on a weakly supervised setting, in which only a small set of hashtags or phrases is labeled. Using our relational approach, we are able to go beyond the stance-indicative patterns and harvest more stance-indicative tweets, which can also be used to train any linear text classifier when the network structure is not available or is costly.

show abstract

A Temporal Attentional Model for Rumor Stance Classification

Veyseh

Ebrahimi

Dou

et al. 2017

View full text Add to dashboard Cite

Topic-Aware Physical Activity Propagation in a Health Social Network

Phan¹,

Ebrahimi²,

Kil

et al. 2016

IEEE Intell. Syst.

View full text Add to dashboard Cite

show abstract

How Can Self-Attention Networks Recognize Dyck-n Languages?

Ebrahimi

Gelda²,

Zhang

2020

View full text Add to dashboard Cite

We focus on the recognition of Dyck-n (D n ) languages with self-attention (SA) networks, which has been deemed to be a difficult task for these networks. We compare the performance of two variants of SA, one with a starting symbol (SA + ) and one without (SA − ). Our results show that SA + is able to generalize to longer sequences and deeper dependencies. For D 2 , we find that SA − completely breaks down on long sequences whereas the accuracy of SA + is 58.82%. We find attention maps learned by SA + to be amenable to interpretation and compatible with a stack-based language recognizer. Surprisingly, the performance of SA networks is at par with LSTMs, which provides evidence on the ability of SA to learn hierarchies without recursion.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Javid Ebrahimi

HotFlip: White-Box Adversarial Examples for Text Classification

HotFlip: White-Box Adversarial Examples for Text Classification

Chain Based RNN for Relation Classification

On Adversarial Examples for Character-Level Neural Machine Translation

Weakly Supervised Tweet Stance Classification by Relational Bootstrapping

A Temporal Attentional Model for Rumor Stance Classification

Topic-Aware Physical Activity Propagation in a Health Social Network

How Can Self-Attention Networks Recognize Dyck-n Languages?

Contact Info

Product

Resources

About