Sanchit Sinha scite author profile

Sanchit Sinha

4Publications

10Citation Statements Received

16Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Virginia, Indraprastha Institute of Information Technology Delhi, Indian Institute of Technology Delhi

Publications

Order By: Most citations

Perturbing Inputs for Fragile Interpretations in Deep Natural Language Processing

Sinha¹,

Chen²,

Sekhon³

et al. 2021

View full text Add to dashboard Cite

Interpretability methods like INTEGRATED GRADIENT and LIME are popular choices for explaining natural language model predictions with relative word importance scores. These interpretations need to be robust for trustworthy NLP applications in high-stake areas like medicine or finance. Our paper demonstrates how interpretations can be manipulated by making simple word perturbations on an input text. Via a small portion of word-level swaps, these adversarial perturbations aim to make the resulting text semantically and spatially similar to its seed input (therefore sharing similar interpretations).Simultaneously, the generated examples achieve the same prediction label as the seed yet are given a substantially different explanation by the interpretation methods. Our experiments generate fragile interpretations to attack two SOTA interpretation methods, across three popular Transformer models and on three different NLP datasets. We observe that the rank order correlation drops by over 20% when less than 10% of words are perturbed on average. Further, rank-order correlation keeps decreasing as more words get perturbed. Furthermore, we demonstrate that candidates generated from our method have good quality metrics. Our code is available at: github.com/QData/ TextAttack-Fragile-Interpretations.

show abstract

Exploring Bias in Primate Face Detection and Recognition

Sinha

Agarwal

Vatsa

et al. 2019

View full text Add to dashboard Cite

Video Summarization using Global Attention with Memory Network and LSTM

Sahrawat

Agarwal

Sinha

et al. 2019

View full text Add to dashboard Cite

Triplet Transform Learning for Automated Primate Face Recognition

Agarwal

Sinha

Singh

et al. 2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sanchit Sinha

Perturbing Inputs for Fragile Interpretations in Deep Natural Language Processing

Exploring Bias in Primate Face Detection and Recognition

Video Summarization using Global Attention with Memory Network and LSTM

Triplet Transform Learning for Automated Primate Face Recognition

Contact Info

Product

Resources

About