Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective

Scimeca, Luca; Oh, Seong Joon; Chun, Sanghyuk; Poli, Michael; Yun, Sangdoo

doi:10.48550/arxiv.2110.03095

Cited by 4 publications

(5 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Shah et al [87] show empirically that in certain scenarios neural networks can suffer from extreme simplicity bias and rely on simple spurious features, while ignoring the core features; in Section 4.2 we revisit these problems and provide further discussion. Hermann and Lampinen [38] and Jacobsen et al [45] also show synthetic and natural examples, where neural networks ignore relevant features, and Scimeca et al [86] explore which types of shortcuts are more likely to be learned. Kolesnikov and Lampert [50] on the other hand show that on realistic datasets core and spurious features can often be distinguished from the latent representations learned by a neural network in the context of object localization.…”

Section: Related Workmentioning

confidence: 99%

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations

Kirichenko¹,

Izmailov²,

Wilson³

2022

Preprint

View full text Add to dashboard Cite

Neural network classifiers can largely rely on simple spurious features, such as backgrounds, to make predictions. However, even in these cases, we show that they still often learn core features associated with the desired attributes of the data, contrary to recent findings. Inspired by this insight, we demonstrate that simple last layer retraining can match or outperform state-of-the-art approaches on spurious correlation benchmarks, but with profoundly lower complexity and computational expenses. Moreover, we show that last layer retraining on large ImageNet-trained models can also significantly reduce reliance on background and texture information, improving robustness to covariate shift, after only minutes of training on a single GPU.

show abstract

Section: Related Workmentioning

confidence: 99%

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations

Kirichenko¹,

Izmailov²,

Wilson³

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…While the distribution shift explains part of the story, we emphasize that what is equally important for shortcut learning is the difficulty of the spurious features themselves (see Fig- 1). Previous works like Shah et al (2020); Scimeca et al (2021) hint at this. But we take this line of thought further by viewing shortcut learning as a phenomenon that impacts the dataset difficulty, which can be captured by monitoring early training dynamics.…”

Section: Introductionmentioning

confidence: 75%

“…Rather only those spurious features that are easier than the core features are potential shortcuts (see Fig- 1). Previous works like Shah et al (2020); Scimeca et al (2021) hint at this by saying that DNNs are biased towards simple solutions, and Dagaev et al (2021) use the "too-good-to-be-true" prior to emphasize that simple solutions are unlikely to be valid across contexts. Veitch et al (2021) distinguish various model features using tools from causality and stress test the models for counterfactual invariance.…”

Section: Related Workmentioning

confidence: 99%

Shortcut Learning Through the Lens of Early Training Dynamics

Murali¹,

Puli²,

Yu³

et al. 2023

Preprint

View full text Add to dashboard Cite

Deep Neural Networks (DNNs) are prone to learn shortcut patterns that damage the generalization of the DNN during deployment. Shortcut Learning is concerning, particularly when the DNNs are applied to safety-critical domains. This paper aims to better understand shortcut learning through the lens of the learning dynamics of the internal neurons during the training process. More specifically, we make the following observations: (1) While previous works treat shortcuts as synonymous with spurious correlations, we emphasize that not all spurious correlations are shortcuts. We show that shortcuts are only those spurious features that are "easier" than the core features. (2) We build upon this premise and use instance difficulty methods (like Prediction Depth (Baldock et al., 2021)) to quantify "easy" and to identify this behavior during the training phase. (3) We empirically show that shortcut learning can be detected by observing the learning dynamics of the DNN's early layers, irrespective of the network architecture used. In other words, easy features learned by the initial layers of a DNN early during the training are potential shortcuts. We verify our claims on simulated and real medical imaging data and justify the empirical success of our hypothesis by showing the theoretical connections between Prediction Depth and information-theoretic concepts like V-usable information (Ethayarajh et al., 2021). Lastly, our experiments show the insufficiency of monitoring only accuracy plots during training (as is common in machine learning pipelines), and we highlight the need for monitoring early training dynamics using example difficulty metrics.

show abstract

“…Biases in machine learning. Emerging studies on DNNs have revealed that DNNs rely on shortcut biases [4,10,21,22,44]. The existing de-biasing methods let a model less attend on the dataset biases in an implicit way by using extra biased networks [4,10] or data augmentations [22] without using bias labels.…”

Section: Related Workmentioning

confidence: 99%

Learning Fair Classifiers with Partially Annotated Group Labels

Jung¹,

Chun²,

Moon³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Recently, fairness-aware learning have become increasingly crucial, but we note that most of those methods operate by assuming the availability of fully annotated grouplabels. We emphasize that such assumption is unrealistic for real-world applications since group label annotations are expensive and can conflict with privacy issues. In this paper, we consider a more practical scenario, dubbed as Algorithmic Fairness with the Partially annotated Group labels (Fair-PG). We observe that the existing fairness methods, which only use the data with group-labels, perform even worse than the vanilla training, which simply uses full data only with target labels, under Fair-PG. To address this problem, we propose a simple Confidence-based Group Label assignment (CGL) strategy that is readily applicable to any fairness-aware learning method. Our CGL utilizes an auxiliary group classifier to assign pseudo group labels, where random labels are assigned to low confident samples. We first theoretically show that our method design is better than the vanilla pseudo-labeling strategy in terms of fairness criteria. Then, we empirically show for UTKFace, CelebA and COMPAS datasets that by combining CGL and the state-of-the-art fairness-aware in-processing methods, the target accuracies and the fairness metrics are jointly improved compared to the baseline methods. Furthermore, we convincingly show that our CGL enables to naturally augment the given group-labeled dataset with external datasets only with target labels so that both accuracy and fairness metrics can be improved. We will release our implementation publicly to make future research reproduce our results.

show abstract

Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective

Cited by 4 publications

References 20 publications

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations

Shortcut Learning Through the Lens of Early Training Dynamics

Learning Fair Classifiers with Partially Annotated Group Labels

Contact Info

Product

Resources

About