General Pitfalls of Model-Agnostic Interpretation Methods for Machine Learning Models

Molnar, Christoph; König, Gunnar; Herbinger, Julia; Freiesleben, Timo; Dandl, Susanne; Scholbeck, Christian A.; Casalicchio, Giuseppe; Grosse-Wentrup, Moritz; Bischl, Bernd

doi:10.1007/978-3-031-04083-2_4

Cited by 81 publications

(73 citation statements)

References 122 publications

(176 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, appropriate selection of the approaches to use can be challenging and depends on the characteristics of the datasets used for model training and validation. Use of inappropriate approaches, such as applying PDP to a dataset containing intercorrelated features, can generate misleading information that is not easy to distinguish and may result in unintentional harm (68). Unfortunately, there is no guideline or standard guiding the use of these approaches, however, increasing the awareness of these techniques in the oncology community is an important initial step to establishing the interdisciplinary collaboration involving clinical experts, data scientists, and ML engineers that will lead to more robust interpretation.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

On the importance of interpretable machine learning predictions to inform clinical decision making in oncology

Swisher

Chung

et al. 2023

Front. Oncol.

View full text Add to dashboard Cite

Machine learning-based tools are capable of guiding individualized clinical management and decision-making by providing predictions of a patient’s future health state. Through their ability to model complex nonlinear relationships, ML algorithms can often outperform traditional statistical prediction approaches, but the use of nonlinear functions can mean that ML techniques may also be less interpretable than traditional statistical methodologies. While there are benefits of intrinsic interpretability, many model-agnostic approaches now exist and can provide insight into the way in which ML systems make decisions. In this paper, we describe how different algorithms can be interpreted and introduce some techniques for interpreting complex nonlinear algorithms.

show abstract

Section: Discussionmentioning

confidence: 99%

“…Model interpretations are not detached from model performance. Misleading information can be a result of interpreting under-or over-fitted models (63,68). Therefore, we suggest prioritizing model generalizability and applying the interpretation approaches to those high-performing models for additional insights.…”

Section: Discussionmentioning

confidence: 99%

On the importance of interpretable machine learning predictions to inform clinical decision making in oncology

Swisher

Chung

et al. 2023

Front. Oncol.

View full text Add to dashboard Cite

show abstract

“…See also a number of previous surveys and critiques of interpretability work that have overlap with ours [3], [58], [60], [68], [95], [118], [136], [173]- [175], [208], [215], [218], [219]. This survey, however, is distinct in its focus on inner interpretability, AI safety, and the intersections between interpretability and several other research paradigms.…”

Section: Scope and Taxonomymentioning

confidence: 98%

Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks

Tilman¹,

Ho²,

Casper³

et al. 2022

Preprint

View full text Add to dashboard Cite

The last decade of machine learning has seen drastic increases in scale and capabilities, and deep neural networks (DNNs) are increasingly being deployed across a wide range of domains. However, the inner workings of DNNs are generally difficult to understand, raising concerns about the safety of using these systems without a rigorous understanding of how they function. In this survey, we review literature on techniques for interpreting the inner components of DNNs, which we call inner interpretability methods. Specifically, we review methods for interpreting weights, neurons, subnetworks, and latent representations with a focus on how these techniques relate to the goal of designing safer, more trustworthy AI systems. We also highlight connections between interpretability and work in modularity, adversarial robustness, continual learning, network compression, and studying the human visual system. Finally, we discuss key challenges and argue for future work in interpretability for AI safety that focuses on diagnostics, benchmarking, and robustness.

show abstract

“…Feature effects offer insights into the impact of a feature on the model outcome, which are particularly advantageous for transient stability enhancement measure design and thus we deem them more suitable to the application proposed in this paper. Many such post-hoc IML techniques exist (reported in [20]) and authors in [21] highlight many of the pitfalls, urging caution when using IML to avoid drawing incorrect conclusions. Local Interpretable Model-agnostic Explanations (LIME) is a local technique capable of providing feature effects for individual points that can be extrapolated to form global explanations [22].…”

Section: Introductionmentioning

confidence: 99%

“…Permutation Feature Importance (PFI) is a global technique [24], used in [25] to interpret DT models trained to predict the transient stability limit. PFI can provide feature importance, but not feature effects [21] and is limited in that feature importance is based on the decrease in model performance (i.e., is linked to the error of the model).…”

Section: Introductionmentioning

confidence: 99%

Using SHAP Values and Machine Learning to Understand Trends in the Transient Stability Limit

Hamilton

Papadopoulos

2024

IEEE Trans. Power Syst.

View full text Add to dashboard Cite

General Pitfalls of Model-Agnostic Interpretation Methods for Machine Learning Models

Cited by 81 publications

References 122 publications

On the importance of interpretable machine learning predictions to inform clinical decision making in oncology

On the importance of interpretable machine learning predictions to inform clinical decision making in oncology

Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks

Using SHAP Values and Machine Learning to Understand Trends in the Transient Stability Limit

Contact Info

Product

Resources

About