2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.97
|View full text |Cite
|
Sign up to set email alerts
|

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Abstract: The success of deep learning in vision can be attributed to: (a) models with high capacity; (b) increased computational power; and (c) availability of large-scale labeled data. Since 2012, there have been significant advances in representation capabilities of the models and computational capabilities of GPUs. But the size of the biggest dataset has surprisingly remained constant. What will happen if we increase the dataset size by 10× or 100×? This paper takes a step towards clearing the clouds of mystery surr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

33
1,153
2
5

Year Published

2017
2017
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 1,775 publications
(1,193 citation statements)
references
References 49 publications
33
1,153
2
5
Order By: Relevance
“…Machine learning tools and improved ability to gather data provides the opportunity to learn more sophisticated WOC methods in a datadriven fashion (Rokach, 2010;Bachrach et al, 2012;Polikar, 2012;Sun et al, 2017). We are likely to learn much more about effective strategies of opinion aggregation through their widespread adoption.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Machine learning tools and improved ability to gather data provides the opportunity to learn more sophisticated WOC methods in a datadriven fashion (Rokach, 2010;Bachrach et al, 2012;Polikar, 2012;Sun et al, 2017). We are likely to learn much more about effective strategies of opinion aggregation through their widespread adoption.…”
Section: Discussionmentioning
confidence: 99%
“…Recent technological advances have also opened up the possibility of gathering very large datasets from which collective wisdom can be extracted (Sun et al, 2017). Large datasets allow researchers to consider and reliably test increasingly complex methodologies of opinion aggregation.…”
Section: Introductionmentioning
confidence: 99%
“…tissue image in this application) are very unique compared to those of the training data, then the trained machine learning models will not recognize the input. As suggested by a recent work on the relationship between deep learning and training data [50], this problem can be solved by using more training data and fine tuning the neural network in our future work. In case of having extremely unique samples, a rejection option [51, 52] can be incorporated to the machine learning models, and the enhanced models can refuse to make a decision if the patterns of input are very different from those of the training data.…”
Section: Discussionmentioning
confidence: 99%
“…(Pratt, 2017) Simultaneously, research on technology improvement across industries suggests that there is a power-law relationship between production and performance: A doubling of production leads to a constant improvement in performance (as measured by cost or other characteristics) (Nagy et al, 2013). A similar relationship may exist for the performance of machine learning algorithms and data sets (Sun et al, 2017). For HAVs, this suggests that achieving gains that some might consider "near perfect" may take much more effort and time than reaching better-than-average human performance, which may itself be still out of reach.…”
Section: What Does the Evidence Suggest About The Conditions That Leamentioning
confidence: 99%