HILDIF: Interactive Debugging of NLI Models Using Influence Functions

Zylberajch, Hugo; Lertvittayakumjorn, Piyawat; Toni, Francesca

doi:10.18653/v1/2021.internlp-1.1

Cited by 14 publications

(20 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Most papers in Table 1 focus on text classification with single input (TC) for a variety of specific problems such as email categorization , topic classification (Kulesza et al, 2015;Teso and Kersting, 2019), spam classification (Koh and Liang, 2017), sentiment analysis (Ribeiro et al, 2018b), and auto-coding of transcripts (Kulesza et al, 2010). By contrast, Zylberajch et al (2021) Ghai et al (2021) suggested that most researchers work on TC because, for this task, it is much easier for lay participants to understand explanations and give feedback (e.g., which keywords should be added or removed from the list of top features). 4 Meanwhile, some other NLP tasks require the feedback providers to have linguistic knowledge such as part-of-speech tagging, parsing, and machine translation.…”

Section: Tasksmentioning

confidence: 99%

“…Some papers even allowed humans to adjust the word importance scores (WS) (Kulesza et al, 2009(Kulesza et al, , 2015. This is analogous to specifying relevancy scores for example-based explanations (ES) in Zylberajch et al (2021). Meanwhile, feedback at the level of learned features (FE) (i.e., the internal neurons in the model) and learned rules (RU) rather than individual words, was asked in Lertvittayakumjorn et al (2020) andRibeiro et al (2018b), respectively.…”

Section: Collecting Feedbackmentioning

confidence: 99%

“…To conduct experiments, some studies in Table 1 selected human participants (SP) to be their feedback providers. The selected participants could be people without ML/NLP knowledge (Kulesza et al, 2010(Kulesza et al, , 2015 or with ML/NLP knowledge (Ribeiro et al, 2018b;Zylberajch et al, 2021) depending on the study objectives and the complexity of the feedback process. Early work even conducted experiments with the participants inperson Kulesza et al, 2009Kulesza et al, , 2015.…”

Section: Experimental Settingmentioning

confidence: 99%

“…However, we suggest that there are several tasks where the trained models are prone to be buggy but the tasks are underexplored in the EBHD setting, though they are not too difficult to experiment on with lay people. NLI, the focus of Zylberajch et al (2021), is one of them. Indeed, McCoy et al (2019) and Gururangan et al (2018) showed that NLI models can exploit annotation artifacts and fallible syntactic heuristics to make predictions rather than learning the logic of the actual task.…”

Section: Tasksmentioning

confidence: 99%

“…pers (ranked by relevancy) for each query and kept only the ones appearing in at least 2 out of the 16 query results. This resulted in 234 papers that we then manually checked, leading to selecting a few additional papers, including Han and Ghosh (2020) and Zylberajch et al (2021). The overall process resulted in 15 papers listed in Table 1 as the selected studies primarily discussed in this survey.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Explanation-Based Human Debugging of NLP Models: A Survey

Lertvittayakumjorn

Toni

2021

Transactions of the Association for Computational Linguistics

Self Cite

View full text Add to dashboard Cite

Debugging a machine learning model is hard since the bug usually involves the training data and the learning process. This becomes even harder for an opaque deep learning model if we have no clue about how the model actually works. In this survey, we review papers that exploit explanations to enable humans to give feedback and debug NLP models. We call this problem explanation-based human debugging (EBHD). In particular, we categorize and discuss existing work along three dimensions of EBHD (the bug context, the workflow, and the experimental setting), compile findings on how EBHD components affect the feedback providers, and highlight open problems that could be future research directions.

show abstract

Section: Tasksmentioning

confidence: 99%

Section: Collecting Feedbackmentioning

confidence: 99%

Section: Experimental Settingmentioning

confidence: 99%

Section: Tasksmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Explanation-Based Human Debugging of NLP Models: A Survey

Lertvittayakumjorn

Toni

2021

Transactions of the Association for Computational Linguistics

Self Cite

View full text Add to dashboard Cite

show abstract

In search of verifiability: Explanations rarely enable complementary performance in AI‐advised decision making

Fok,

Weld

2024

AI Magazine

View full text Add to dashboard Cite

The current literature on AI‐advised decision making—involving explainable AI systems advising human decision makers—presents a series of inconclusive and confounding results. To synthesize these findings, we propose a simple theory that elucidates the frequent failure of AI explanations to engender appropriate reliance and complementary decision making performance. In contrast to other common desiderata, for example, interpretability or spelling out the AI's reasoning process, we argue that explanations are only useful to the extent that they allow a human decision maker to verify the correctness of the AI's prediction. Prior studies find in many decision making contexts that AI explanations do not facilitate such verification. Moreover, most tasks fundamentally do not allow easy verification, regardless of explanation method, limiting the potential benefit of any type of explanation. We also compare the objective of complementary performance with that of appropriate reliance, decomposing the latter into the notions of outcome‐graded and strategy‐graded reliance.

show abstract

RA³: A Human-in-the-loop Framework for Interpreting and Improving Image Captioning with Relation-Aware Attribution Analysis

Chai,

Qi,

Sun

et al. 2024

2024 IEEE 40th International Conference on Data Engineering (ICDE)

View full text Add to dashboard Cite

HILDIF: Interactive Debugging of NLI Models Using Influence Functions

Cited by 14 publications

References 15 publications

Explanation-Based Human Debugging of NLP Models: A Survey

Explanation-Based Human Debugging of NLP Models: A Survey

In search of verifiability: Explanations rarely enable complementary performance in AI‐advised decision making

RA³: A Human-in-the-loop Framework for Interpreting and Improving Image Captioning with Relation-Aware Attribution Analysis

Contact Info

Product

Resources

About

HILDIF: Interactive Debugging of NLI Models Using Influence Functions

Cited by 14 publications

References 15 publications

Explanation-Based Human Debugging of NLP Models: A Survey

Explanation-Based Human Debugging of NLP Models: A Survey

In search of verifiability: Explanations rarely enable complementary performance in AI‐advised decision making

RA3: A Human-in-the-loop Framework for Interpreting and Improving Image Captioning with Relation-Aware Attribution Analysis

Contact Info

Product

Resources

About

RA³: A Human-in-the-loop Framework for Interpreting and Improving Image Captioning with Relation-Aware Attribution Analysis