Classification Uncertainty of Deep Neural Networks Based on Gradient Information

Oberdiek, Philipp; Rottmann, Matthias; Gottschalk, Hanno

doi:10.1007/978-3-319-99978-4_9

Cited by 55 publications

(42 citation statements)

References 10 publications

(10 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Uncertainty Methods. We evaluate MC-Dropout (DO) [4], MC-DropConnect (DC) [11], Deep Ensembles (DE) [10], Direct Uncertainty Quantification (DUQ) [15], Variational Inference with Flipout (VI) [16], and Gradient-based uncertainty (GD) [12]. This selection covers scalable as well as approximate methods and recent advances.…”

Section: Methodsmentioning

confidence: 99%

“…Gradient Uncertainty (GD) This method [12] computes the gradient of the loss with respect to trainable parameters, using a virtual label that is the one-hot encoded version of the predicted label, and passes the gradient vector through an aggregation function that produces a scalar, which can be used as an uncertainty measure. This can only be done in a classification setting.…”

Section: Deep Ensembles (De)mentioning

confidence: 99%

See 1 more Smart Citation

Exploring the Limits of Epistemic Uncertainty Quantification in Low-Shot Settings

Valdenegro-Toro¹

2021

Preprint

View full text Add to dashboard Cite

Uncertainty quantification in neural network promises to increase safety of AI systems, but it is not clear how performance might vary with the training set size. In this paper we evaluate seven uncertainty methods on Fashion MNIST and CIFAR10, as we sub-sample and produce varied training set sizes. We find that calibration error and out of distribution detection performance strongly depend on the training set size, with most methods being miscalibrated on the test set with small training sets. Gradient-based methods seem to poorly estimate epistemic uncertainty and are the most affected by training set size. We expect our results can guide future research into uncertainty quantification and help practitioners select methods based on their particular available data.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Deep Ensembles (De)mentioning

confidence: 99%

Exploring the Limits of Epistemic Uncertainty Quantification in Low-Shot Settings

Valdenegro-Toro¹

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Supervised [19] Uncertainty measure based on the gradient of the negative log-likelihood is used as a measure of confidence Supervised [20] Confidence scores based on Mahalanobis distance from different layers is combined using weighted averaging Supervised [21] Invariance of classifier's softmax under various transformations to input image is used as a measure of confidence Supervised [22] Ratio of Hausdorff distances between test sample to the nearest non-predicted and the predicted classes is used as the trust score Semi-supervised [23] Likelihood ratio-based method is used to differentiate between in-distribution and OOD examples Semi-supervised [24] A two-head CNN consisting of a common feature extractor and two classifiers with different decision boundaries is trained to detect OOD examples Unsupervised [25] Predicted softmax probability is used to detect OOD examples Unsupervised [26] Temperature scaling and by adding small perturbations to the input is used to better separate the softmax score for OOD detection Unsupervised [27] GAN based architecture is used to compare the bottleneck features of the generated image with that of the test image Unsupervised [28] Degenerated prior network with concentration perturbation algorithm is used to get better uncertainty measure Unsupervised [29] Learning to discriminate between geometric transformations is used for learning unique features that are useful in OOD detection Unsupervised [30] Mahalanobis distance is applied in the latent space of the autoencoder to detect OOD examples Unsupervised [31] Resampling uncertainty estimation approach is proposed as an approximation to the bootstrap…”

Section: Classification Type Reference Contributionsmentioning

confidence: 99%

“…In [19], an approach to measure uncertainty of a neural network based on gradient information of the negative loglikelihood at the predicted class label is presented. The gradient metrics are computed from all the layers in this method and scalarized using norm or min/max operations.…”

Section: A Supervised Approachesmentioning

confidence: 99%

Anomalous Example Detection in Deep Learning: A Survey

et al. 2020

View full text Add to dashboard Cite

Deep Learning (DL) is vulnerable to out-of-distribution and adversarial examples resulting in incorrect outputs. To make DL more robust, several posthoc (or runtime) anomaly detection techniques to detect (and discard) these anomalous samples have been proposed in the recent past. This survey tries to provide a structured and comprehensive overview of the research on anomaly detection for DL based applications. We provide a taxonomy for existing techniques based on their underlying assumptions and adopted approaches. We discuss various techniques in each of the categories and provide the relative strengths and weaknesses of the approaches. Our goal in this survey is to provide an easier yet better understanding of the techniques belonging to different categories in which research has been done on this topic. Finally, we highlight the unsolved research challenges while applying anomaly detection techniques in DL systems and present some high-impact future research directions.

show abstract

“…Rather than relying on the maximum softmax score, some researchers tried to define the score functions based on different distance measures. For example, the Mahalanobis distance was calculated and calibrated on the intermediate features of DNNs to serve as the confidence score [9,10]; A measure of confidence was proposed by analyzing the invariance of softmax score under various transformations of inputs [11]; the uncertainty of DNNs was evaluated by using the gradient information from all the layers to serve as the score function [12]; the trust score for each input was defined as the ratio of the Hausdorff distances from the input to its closest and second closest labels, which is used to determine whether a classifier's prediction can be trusted or not [13]. By projecting the inputs into a new space, these newly defined score functions can distinguish the InD and OOD samples better than the methods relying on the softmax score.…”

Section: Ood Detection Without Tuning the Pre-trained Classifiermentioning

confidence: 99%

Ensemble-Based Out-of-Distribution Detection

et al. 2021

View full text Add to dashboard Cite

To design an efficient deep learning model that can be used in the real-world, it is important to detect out-of-distribution (OOD) data well. Various studies have been conducted to solve the OOD problem. The current state-of-the-art approach uses a confidence score based on the Mahalanobis distance in a feature space. Although it outperformed the previous approaches, the results were sensitive to the quality of the trained model and the dataset complexity. Herein, we propose a novel OOD detection method that can train more efficient feature space for OOD detection. The proposed method uses an ensemble of the features trained using the softmax-based classifier and the network based on distance metric learning (DML). Through the complementary interaction of these two networks, the trained feature space has a more clumped distribution and can fit well on the Gaussian distribution by class. Therefore, OOD data can be efficiently detected by setting a threshold in the trained feature space. To evaluate the proposed method, we applied our method to various combinations of image datasets. The results show that the overall performance of the proposed approach is superior to those of other methods, including the state-of-the-art approach, on any combination of datasets.

show abstract

Classification Uncertainty of Deep Neural Networks Based on Gradient Information

Cited by 55 publications

References 10 publications

Exploring the Limits of Epistemic Uncertainty Quantification in Low-Shot Settings

Exploring the Limits of Epistemic Uncertainty Quantification in Low-Shot Settings

Anomalous Example Detection in Deep Learning: A Survey

Ensemble-Based Out-of-Distribution Detection

Contact Info

Product

Resources

About