2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00037
|View full text |Cite
|
Sign up to set email alerts
|

How Much More Data Do I Need? Estimating Requirements for Downstream Tasks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
23
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(28 citation statements)
references
References 15 publications
1
23
0
Order By: Relevance
“…Multi-variate scaling laws have also been considered for some specific tasks, for example in transfer learning from synthetic to real data sets [11]. Finally, Mahmood et al [2] explore data collection by estimating the minimum amount of data needed to meet a given target performance over multiple rounds. Our paper extends these prior studies by developing an optimization problem to minimize the expected total cost of data collected.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Multi-variate scaling laws have also been considered for some specific tasks, for example in transfer learning from synthetic to real data sets [11]. Finally, Mahmood et al [2] explore data collection by estimating the minimum amount of data needed to meet a given target performance over multiple rounds. Our paper extends these prior studies by developing an optimization problem to minimize the expected total cost of data collected.…”
Section: Related Workmentioning
confidence: 99%
“…For instance, Rosenfeld et al [6] fit power law functions on the performance statistics of small data sets to extrapolate the learning curve with more data. In contrast, Mahmood et al [2] consider estimating data requirements and show that even small errors in a power law model of the learning curve can translate to massively over-or underestimating how much data is needed. Beyond this, different data sources have different costs and scale differently with performance [11][12][13][14].…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations