2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00672
|View full text |Cite
|
Sign up to set email alerts
|

Few-Shot Learning With Localization in Realistic Settings

Abstract: Traditional recognition methods typically require large, artificially-balanced training classes, while few-shot learning methods are tested on artificially small ones. In contrast to both extremes, real world recognition problems exhibit heavy-tailed class distributions, with cluttered scenes and a mix of coarse and fine-grained class distinctions. We show that prior methods designed for few-shot learning do not work out of the box in these challenging conditions, based on a new "meta-iNat" benchmark. We intro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
86
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 123 publications
(90 citation statements)
references
References 29 publications
0
86
0
Order By: Relevance
“…It shows that when learning non-linear metric methods, with other network added, the training parameters in the model are increasing, it is easy for the network to overfit so as to cause a bad result. Besides, we compare our result with 17 state-of-the-art meta-metric learning algorithms, which are D-SVS [33], SN [34], SRPN [35], PML [29], DN4 [29], PCP [21], LCC [36], L2AE-D [37], IMP [23], PN [38], METRIC1 [39], DC [17], CovaMNET [40], AM3 [22], VFL [41], SHS [42], PARN [43]. As is shown in FIGURE 4, our results by combining Matching Network with fine-tuned ResNet50 outperform all of the state-of-the-art results, which illustrates the importance of transfer learning.…”
Section: )mentioning
confidence: 99%
“…It shows that when learning non-linear metric methods, with other network added, the training parameters in the model are increasing, it is easy for the network to overfit so as to cause a bad result. Besides, we compare our result with 17 state-of-the-art meta-metric learning algorithms, which are D-SVS [33], SN [34], SRPN [35], PML [29], DN4 [29], PCP [21], LCC [36], L2AE-D [37], IMP [23], PN [38], METRIC1 [39], DC [17], CovaMNET [40], AM3 [22], VFL [41], SHS [42], PARN [43]. As is shown in FIGURE 4, our results by combining Matching Network with fine-tuned ResNet50 outperform all of the state-of-the-art results, which illustrates the importance of transfer learning.…”
Section: )mentioning
confidence: 99%
“…Fine-grained-based models. Now that few-shot learning in fine-grained scenarios has been studied, we chose some new work published as baselines-few-shot fine-grained image recognition (FSFG; Wei et al, 2019) and Meta-iNat (Wertheimer & Hariharan, 2019) and some other classical models that are validated on fine-grained data sets.…”
Section: Baselinesmentioning
confidence: 99%
“…Although there are some classic studies of few-shot learning in finegrained settings Wei, Wang, Liu, Shen, and Wu, 2019;Wertheimer & Hariharan, 2019), two problems remain to be solved in the existing methods. One problem is that a limited number of images per class can not represent the class distribution effectively, and image-level features are difficult to reflect the subtle visual differences between fine-grained images.…”
Section: Introductionmentioning
confidence: 99%
“…The main ideas of the above studies can be summarized as designing the neural network structure, obtaining a good initialization, guiding the search steps or augmenting the dataset. In contrast to those studies, [14] and [27] focus on the different roles played by backgrounds and foregrounds in the classification task. [14] attempts to improve the performance of the model by enhancing main features with foregrounds.…”
Section: Introductionmentioning
confidence: 99%
“…In contrast to those studies, [14] and [27] focus on the different roles played by backgrounds and foregrounds in the classification task. [14] attempts to improve the performance of the model by enhancing main features with foregrounds. For this purpose, an additional network is introduced to locate the foregrounds, which brings the increase of the network complexity and the requirement of the dataset with bounding box.…”
Section: Introductionmentioning
confidence: 99%