Towards explaining anomalies: A deep Taylor decomposition of one-class models

Kauffmann, Jacob R.; Müller, Klaus‐Robert; Montavon, Grégoire

doi:10.1016/j.patcog.2020.107198

Cited by 74 publications

(61 citation statements)

References 45 publications

(63 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Such explanations help to verify the predictions and establish trust in the correct functioning on the system. Layer-wise Relevance Propagation (LRP) [9,58] provides a general framework for explaining individual predictions, i.e., it is applicable to various ML models, including neural networks [9], LSTMs [7], Fisher Vector classifiers [44] and Support Vector Machines [35]. Section 4 gives an overview over recently proposed methods for computing individual explanations.…”

Section: Explaining Individual Predictionsmentioning

confidence: 99%

“…The propagation process can be theoretically embedded in the deep Taylor decomposition framework [59]. More recently, LRP was extended to a wider set of machine learning models, e.g., in clustering [36] or anomaly detection [35], by first transforming the model into a neural network ('neuralization') and then applying LRP to explain its predictions. The leveraging of the model structure together with the use of appropriate (theoretically-motivated) propagation rules, enables LRP to deliver good explanations at very low computational cost (one forward and one backward pass).…”

Section: Propagation-based Approaches (Leveraging Structure)mentioning

confidence: 99%

See 1 more Smart Citation

Towards Explainable Artificial Intelligence

Samek

Müller

2019

Lecture Notes in Computer Science

Self Cite

541

357

View full text Add to dashboard Cite

In recent years, machine learning (ML) has become a key enabling technology for the sciences and industry. Especially through improvements in methodology, the availability of large databases and increased computational power, today's ML algorithms are able to achieve excellent performance (at times even exceeding the human level) on an increasing number of complex tasks. Deep learning models are at the forefront of this development. However, due to their nested non-linear structure, these powerful models have been generally considered "black boxes", not providing any information about what exactly makes them arrive at their predictions. Since in many applications, e.g., in the medical domain, such lack of transparency may be not acceptable, the development of methods for visualizing, explaining and interpreting deep learning models has recently attracted increasing attention. This introductory paper presents recent developments and applications in this field and makes a plea for a wider use of explainable learning algorithms in practice.

show abstract

Section: Explaining Individual Predictionsmentioning

confidence: 99%

Section: Propagation-based Approaches (Leveraging Structure)mentioning

confidence: 99%

Towards Explainable Artificial Intelligence

Samek

Müller

2019

Lecture Notes in Computer Science

Self Cite

541

357

View full text Add to dashboard Cite

show abstract

“…(Ribeiro et al, 2016). Recently, some interpretation methods have emerged to understand models beyond classification tasks (Samek et al, 2020;Kauffmann et al, 2020;, including the one we present in this paper for the purpose of cluster explanation. ACE's perturbation approach draws inspiration from adversarial machine learning (Xu et al, 2020) where imperceivable perturbations are maliciously crafted to mislead a machine learning model to predict incorrect outputs.…”

Section: Related Workmentioning

confidence: 94%

ACE: Explaining cluster from an adversarial perspective

Bonora

Noble

2021

Preprint

View full text Add to dashboard Cite

A common workflow in single-cell RNA-seq analysis is to project the data to a latent space, cluster the cells in that space, and identify sets of marker genes that explain the differences among the discovered clusters. A primary drawback to this three-step procedure is that each step is carried out independently, thereby neglecting the effects of the nonlinear embedding and inter-gene dependencies on the selection of marker genes. Here we propose an integrated deep learning framework, Adversarial Clustering Explanation (ACE), that bundles all three steps into a single workflow. The method thus moves away from the notion of "marker genes" to instead identify a panel of explanatory genes. This panel may include genes that are not only enriched but also depleted relative to other cell types, as well as genes that exhibit differences between closely related cell types. Empirically, we demonstrate that ACE is able to identify gene panels that are both highly discriminative and nonredundant, and we demonstrate the applicability of ACE to an image recognition task.

show abstract

“…Some approaches have been extended to unsupervised models, e.g. anomaly detection [38], [39] and clustering [40], and attention models have also been developed to explain tasks different from classification such as image captioning [41] or similarity [42]. Our work goes further along this direction and explains similarity built on general neural network models, and by identifying relevant pairs of input features.…”

Section: Related Workmentioning

confidence: 99%

“…jk (x) = min a j (x), τ (a k (x)) The 'min' operation be interpreted as a continuous 'AND' [38], and tests at each location for the presence of bigrams jk ∈ 00-99. The function τ represents some translation operation, and we apply several of them to produce candidate alignments between the digits forming the bigrams (e.g.…”

Section: The 'Bigram Network'mentioning

confidence: 99%

Building and Interpreting Deep Similarity Models

Eberle

Büttner

Kräutli

et al. 2022

IEEE Trans. Pattern Anal. Mach. Intell.

View full text Add to dashboard Cite

Many learning algorithms such as kernel machines, nearest neighbors, clustering, or anomaly detection, are based on distances or similarities. Before similarities are used for training an actual machine learning model, we would like to verify that they are bound to meaningful patterns in the data. In this paper, we propose to make similarities interpretable by augmenting them with an explanation. We develop BiLRP, a scalable and theoretically founded method to systematically decompose the output of an already trained deep similarity model on pairs of input features. Our method can be expressed as a composition of LRP explanations, which were shown in previous works to scale to highly nonlinear models. Through an extensive set of experiments, we demonstrate that BiLRP robustly explains complex similarity models, e.g. built on VGG-16 deep neural network features. Additionally, we apply our method to an open problem in digital humanities: detailed assessment of similarity between historical documents such as astronomical tables. Here again, BiLRP provides insight and brings verifiability into a highly engineered and problem-specific similarity model.

show abstract

Towards explaining anomalies: A deep Taylor decomposition of one-class models

Cited by 74 publications

References 45 publications

Towards Explainable Artificial Intelligence

Towards Explainable Artificial Intelligence

ACE: Explaining cluster from an adversarial perspective

Building and Interpreting Deep Similarity Models

Contact Info

Product

Resources

About