Assessing the Trustworthiness of Saliency Maps for Localizing                     Abnormalities in Medical Imaging

Arun, Nishanth; Gaw, Nathan; Singh, Praveer; Chang, Ken; Aggarwal, Mehak; Chen, Bryan; Hoebel, Katharina; Gupta, Sharut; Patel, Jay; Gidwani, Mishka; Adebayo, Julius; Li, Matthew; Kalpathy–Cramer, Jayashree

doi:10.1148/ryai.2021200267

Cited by 132 publications

(83 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As saliency maps have been shown to be not reliable in some cases, it is important to ensure their robustness against the model weights, label randomization, as well as their repeatability and localization relevance (Arun et al, 2021). Therefore, sanity checks were conducted, following Adebayo et al (2018).…”

Section: Methodsmentioning

confidence: 99%

Multimodal biological brain age prediction using magnetic resonance imaging and angiography with the identification of predictive regions

Mouchès

Wilms

Rajashekar

et al. 2022

Human Brain Mapping

View full text Add to dashboard Cite

Biological brain age predicted using machine learning models based on high‐resolution imaging data has been suggested as a potential biomarker for neurological and cerebrovascular diseases. In this work, we aimed to develop deep learning models to predict the biological brain age using structural magnetic resonance imaging and angiography datasets from a large database of 2074 adults (21–81 years). Since different imaging modalities can provide complementary information, combining them might allow to identify more complex aging patterns, with angiography data, for instance, showing vascular aging effects complementary to the atrophic brain tissue changes seen in T1‐weighted MRI sequences. We used saliency maps to investigate the contribution of cortical, subcortical, and arterial structures to the prediction. Our results show that combining T1‐weighted and angiography MR data led to a significantly improved brain age prediction accuracy, with a mean absolute error of 3.85 years comparing the predicted and chronological age. The most predictive brain regions included the lateral sulcus, the fourth ventricle, and the amygdala, while the brain arteries contributing the most to the prediction included the basilar artery, the middle cerebral artery M2 segments, and the left posterior cerebral artery. Our study proposes a framework for brain age prediction using multimodal imaging, which gives accurate predictions and allows identifying the most predictive regions for this task, which can serve as a surrogate for the brain regions that are most affected by aging.

show abstract

Section: Methodsmentioning

confidence: 99%

Multimodal biological brain age prediction using magnetic resonance imaging and angiography with the identification of predictive regions

Mouchès

Wilms

Rajashekar

et al. 2022

Human Brain Mapping

View full text Add to dashboard Cite

show abstract

“…However, the interpretation of these results warrants additional scrutiny because recent studies emphasized that many popular saliency maps used to interpret CNN trained on medical imaging did not meet several key criteria for utility and robustness, highlighting the need for additional validation before clinical application. 45 – 47 For the alternative technique, a computer-aided diagnosis system that utilizes the complementary information from CNN-based and feature-based methods will need to be further developed. Also, qualitative analysis of the latest techniques to better obtain the activation map will be required.…”

Section: Discussionmentioning

confidence: 99%

“…Also, qualitative analysis of the latest techniques to better obtain the activation map will be required. 45 …”

Section: Discussionmentioning

confidence: 99%

A Deep Learning Algorithm for Classifying Diabetic Retinopathy Using Optical Coherence Tomography Angiography

Ryu

Lee

Park

et al. 2022

Trans. Vis. Sci. Tech.

View full text Add to dashboard Cite

Purpose To develop an automated diabetic retinopathy (DR) staging system using optical coherence tomography angiography (OCTA) images with a convolutional neural network (CNN) and to verify the feasibility of the system. Methods In this retrospective cross-sectional study, a total of 918 data sets of 3 × 3 mm 2 OCTA images and 917 data sets of 6 × 6 mm 2 OCTA images were obtained from 1118 eyes. A deep CNN and four traditional machine learning models were trained with annotations made by a retinal specialist based on ultra-widefield fluorescein angiography. Separately, the same images of the test data sets were independently graded by two human experts. The results of the CNN algorithm were compared with those of traditional machine learning–based classifiers and human experts. Results The proposed CNN achieved an accuracy of 0.728, a sensitivity of 0.675, a specificity of 0.944, an F1 score of 0.683, and a quadratic weighted κ of 0.908 for a six-level staging task, which were far superior to the results of traditional machine learning methods or human experts. The CNN algorithm showed a better performance using 6 × 6 mm 2 rather than 3 × 3 mm 2 sized OCTA images and using combined data rather than a separate OCTA layer alone. Conclusions CNN-based classification using OCTA images can provide reliable assistance to clinicians for DR classification. Translational Relevance This CNN algorithm can guide the clinical decision for invasive angiography or referrals to ophthalmology specialists, helping to create more efficient diagnostic workflow in primary care settings.

show abstract

“…Although existing works on XAI evaluation proposed many real-world application desiderata and evaluation metrics [65,49,71,21,32,2,27,20,24], there is not a canonical criterion on the goodness of explanation, and it is unknown which evaluation objectives are suitable for clinical applications. For the very limited emerging XAI evaluation works on medical image tasks, such as on retinal [63], endoscopic [19], and chest X-Ray [5] imaging tasks, the evaluation mainly focused on one criterion, which is how well the explanation agrees with clinical prior knowledge, without justification for the selection of such criterion and its clinical applicability. This evaluation criterion may be confounded by factors outside XAI methods themselves, such as model training and spurious patterns in the data, as detailed in §2.2.…”

Section: Introductionmentioning

confidence: 99%

Guidelines and evaluation for clinical explainable AI on medical image analysis

Jin¹,

Li²,

Fatehi³

et al. 2022

Preprint

View full text Add to dashboard Cite

Explainable artificial intelligence (XAI) is essential for enabling clinical users to get informed decision support from AI and comply with evidence-based medical practice. Applying XAI in clinical settings requires proper evaluation criteria to ensure the explanation technique is both technically sound and clinically useful, but specific support is lacking to achieve this goal. To bridge the research gap, we propose the Clinical XAI Guidelines that consist of five criteria a clinical XAI needs to be optimized for. The guidelines recommend choosing an explanation form based on Guideline 1 (G1) Understandability and G2 Clinical relevance. For the chosen explanation form, its specific XAI technique should be optimized for G3 Truthfulness, G4 Informative plausibility, and G5 Computational efficiency.Following the guidelines, we conducted a systematic evaluation on a novel problem of multi-modal medical image explanation with two clinical tasks, and proposed new evaluation metrics accordingly. The evaluated 16 commonly-used heatmap XAI techniques were not suitable for clinical use due to their failure in G3 and G4. Our evaluation demonstrated the use of Clinical XAI Guidelines to support the design and evaluation for clinically viable XAI.

show abstract

Assessing the Trustworthiness of Saliency Maps for Localizing Abnormalities in Medical Imaging

Cited by 132 publications

References 29 publications

Multimodal biological brain age prediction using magnetic resonance imaging and angiography with the identification of predictive regions

Multimodal biological brain age prediction using magnetic resonance imaging and angiography with the identification of predictive regions

A Deep Learning Algorithm for Classifying Diabetic Retinopathy Using Optical Coherence Tomography Angiography

Guidelines and evaluation for clinical explainable AI on medical image analysis

Contact Info

Product

Resources

About