Building Interpretable Interaction Trees for Deep NLP Models

Zhang, Die; Zhou, Huilin; Zhang, Hao; Bao, Xiaoyi; Huo, Da; Chen, Ruizhao; Xu, Chen; Wu, Mengyue; Zhang, Quanshi

doi:10.1609/aaai.v35i16.17685

Cited by 11 publications

(2 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Additionally, using hierarchical representations to analyze DNN has become an area of interest in the NLP community. Zhang et al [93] constructed a tree to encode salient interactions extracted by DNN, on the basis of Shapley values of words [50]. Considering that displaying the original attribution matrix in the case of long text causes visual clutter and that most elements of the attribution matrix are minimal (close to zero), we adopt the tree generation algorithm proposed by Hao et al [25] to display the information flow inside the module (R4).…”

Section: Exploring Layer-level Information Flowmentioning

confidence: 99%

Visual Explanation for Open-domain Question Answering with BERT

Zekai

Sun

Zhao

et al. 2024

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

show abstract

Section: Exploring Layer-level Information Flowmentioning

confidence: 99%

Visual Explanation for Open-domain Question Answering with BERT

Zekai

Sun

Zhao

et al. 2024

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

show abstract

“…Sikdar et al (2021) compute importance scores in a bottom-up manner starting from the individual embedding dimensions, working its way up to tokens, words, phrases, and finally the sentence. Zhang et al (2021) build interpretable interaction trees, where the interaction is again defined based on Shapley values. While these methods produce spans of tokens that are part of an interaction, the hierarchical nature of the explanation limits the interactions only to neighboring spans.…”

Section: Related Workmentioning

confidence: 99%

Explaining Interactions Between Text Spans

Choudhury,

Atanasova,

Augenstein

2023

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Reasoning over spans of tokens from different parts of the input is essential for natural language understanding (NLU) tasks such as fact-checking (FC), machine reading comprehension (MRC) or natural language inference (NLI). However, existing highlight-based explanations primarily focus on identifying individual important tokens or interactions only between adjacent tokens or tuples of tokens. Most notably, there is a lack of annotations capturing the human decision-making process w.r.t. the necessary interactions for informed decision-making in such tasks. To bridge this gap, we introduce SpanEx, a multi-annotator dataset of human span interaction explanations for two NLU tasks: NLI and FC. We then investigate the decision-making processes of multiple fine-tuned large language models in terms of the employed connections between spans in separate parts of the input and compare them to the human reasoning processes. Finally, we present a novel community detection based unsupervised method to extract such interaction explanations from a model's inner workings. 1

show abstract