2022
DOI: 10.1016/j.isci.2022.104176
|View full text |Cite
|
Sign up to set email alerts
|

Multilingual multi-aspect explainability analyses on machine reading comprehension models

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1
1

Relationship

3
3

Authors

Journals

citations
Cited by 10 publications
(17 citation statements)
references
References 24 publications
0
17
0
Order By: Relevance
“…We can see that the exact match (EM) score is 80.567, and the F1 score is 88.117.
{"exact": 80.56764427625355, "f1": 88.11721947565059, "total": 10570, "HasAns_exact": 80.56764427625355, "HasAns_f1": 88.11721947565059, "HasAns_total": 10570}
Note: As the main goal of our previous work ( Cui et al., 2022 ) was to provide robust and comprehensive analyses of machine reading comprehension models, we carried out each experiment five times with different random seeds, and their average scores were used. However, to minimize the training time, we only train one model in this protocol, and it can be easily generalized to multiple runs as well by running steps 1–3 multiple times.…”
Section: Step-by-step Methods Detailsmentioning
confidence: 99%
See 2 more Smart Citations
“…We can see that the exact match (EM) score is 80.567, and the F1 score is 88.117.
{"exact": 80.56764427625355, "f1": 88.11721947565059, "total": 10570, "HasAns_exact": 80.56764427625355, "HasAns_f1": 88.11721947565059, "HasAns_total": 10570}
Note: As the main goal of our previous work ( Cui et al., 2022 ) was to provide robust and comprehensive analyses of machine reading comprehension models, we carried out each experiment five times with different random seeds, and their average scores were used. However, to minimize the training time, we only train one model in this protocol, and it can be easily generalized to multiple runs as well by running steps 1–3 multiple times.…”
Section: Step-by-step Methods Detailsmentioning
confidence: 99%
“…In this context, we propose to visualize the attention by using a multilingual and multi-aspect way to comprehensively understand whether these attentions can be explainable ( Cui et al., 2022 ). Instead of analyzing the attention matrix as a whole, we decompose the attention matrix into four different attention zones to explicitly analyze their behaviors.…”
Section: Before You Beginmentioning
confidence: 99%
See 1 more Smart Citation
“…For the explainability studies in MRC, [26] propose a method to extract evidence sentences from multi-choice MRC tasks. [27] propose to use system performance rather than visualizing attention score to better reveal the model's explainability. [28] investigate a few black-box attacks at the character, word, and sentence level for MRC systems.…”
Section: Related Workmentioning
confidence: 99%
“…Transformer Pruning Previous studies (Michel et al, 2019;Voita et al, 2019) have shown that not all attention heads are equally important in the transformers, and some of the attention heads can be pruned without performance loss (Cui et al, 2022). Thus, Identifying and removing the least important attention heads can reduce the model size and have a small impact on performance.…”
Section: Pruning Modementioning
confidence: 99%