2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00333
|View full text |Cite
|
Sign up to set email alerts
|

Spectral Metric for Dataset Complexity Assessment

Abstract: In this paper, we propose a new measure to gauge the complexity of image classification problems. Given an annotated image dataset, our method computes a complexity measure called the cumulative spectral gradient (CSG) which strongly correlates with the test accuracy of convolutional neural networks (CNN). The CSG measure is derived from the probabilistic divergence between classes in a spectral clustering framework. We show that this metric correlates with the overall separability of the dataset and thus its … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
3

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(20 citation statements)
references
References 38 publications
(58 reference statements)
0
16
0
Order By: Relevance
“…For comparison with other quality measures, we chose F1, N1, and N3 from Ho and Basu [1] and CSG from Branchaud-Charron et al [3]. Here, N1, N3, and CSG are known to be highly correlated with test accuracy of classifiers Branchaud-Charron et al [3]. F1 is similar to our M sep in its basic idea.…”
Section: Experiments Resultsmentioning
confidence: 99%
“…For comparison with other quality measures, we chose F1, N1, and N3 from Ho and Basu [1] and CSG from Branchaud-Charron et al [3]. Here, N1, N3, and CSG are known to be highly correlated with test accuracy of classifiers Branchaud-Charron et al [3]. F1 is similar to our M sep in its basic idea.…”
Section: Experiments Resultsmentioning
confidence: 99%
“…DA is a tradeoff between noise vs. knowledge injection, so it could be a nice theoretic direction to think about DA under statistical query model (Kearns, 1998) with translation between formal languages (ws-, 2019). This could inspire another essential question: what is the intrinsic properties of the augmented data (Branchaud-Charron et al, 2019) that matter in discrete domain. Applications like active data selection (Coleman et al, 2019) guided with margin or sensitivity can be derived.…”
Section: Discussionmentioning
confidence: 99%
“…More recent work for characterisation of large image datasets is shown by Branchaud-Charron et al [2019].…”
Section: Related Workmentioning
confidence: 99%