2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00610
|View full text |Cite
|
Sign up to set email alerts
|

Low-Shot Learning with Imprinted Weights

Abstract: Human vision is able to immediately recognize novel visual categories after seeing just one or a few training examples. We describe how to add a similar capability to ConvNet classifiers by directly setting the final layer weights from novel training examples during low-shot learning. We call this process weight imprinting as it directly sets weights for a new category based on an appropriately scaled copy of the embedding layer activations for that training example. The imprinting process provides a valuable … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
439
1

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 500 publications
(443 citation statements)
references
References 16 publications
3
439
1
Order By: Relevance
“…Unfortunately, the preliminary results were disappointing, as we saw no significant improvements in accuracy when compared to using only the directly synthesized speech data. This parallels the findings of [10] where their augmentation of the small amount of data available for a new class failed to improve its classifier performance. We also suspect this may be due to the embedding model already having learned to deal with these distortions.…”
Section: Synthesized Speech Datasupporting
confidence: 76%
“…Unfortunately, the preliminary results were disappointing, as we saw no significant improvements in accuracy when compared to using only the directly synthesized speech data. This parallels the findings of [10] where their augmentation of the small amount of data available for a new class failed to improve its classifier performance. We also suspect this may be due to the embedding model already having learned to deal with these distortions.…”
Section: Synthesized Speech Datasupporting
confidence: 76%
“…We take pre-trained ResNet18 [5] with ImageNet as the feature extractor f ϕ . Train/test split setting is followed the suggestion of Imprinted Weights [47]. Here, 100 novel classes are required to be 33 distinguished, which is very challenging and similar to the real-world scenario.…”
Section: Caltech-ucsd Birdsmentioning
confidence: 99%
“…An interesting work regarding CNN classifiers using low-shot learning is given in [28]. The idea is to enable a model to successfully classify a newly seen category after being presented with merely few training examples.…”
Section: Future Workmentioning
confidence: 99%
“…Combining this work and DeepMimic might be very interesting, in the following sense. While using a mentor model trained on specific categories, upon the arrival of a novel category it might be easier to implant the new category in a student model combining the two processes described in DeepMimic and [28]. It is possible that a student model would adjust more naturally to new categories during the training process itself rather than an already trained model.…”
Section: Future Workmentioning
confidence: 99%