Understanding the Effects of Pre-Training for Object Detectors via Eigenspectrum

Shinya, Yosuke; Simo-Serra, Edgar; Suzuki, Taiji

doi:10.1109/iccvw.2019.00242

Cited by 10 publications

(7 citation statements)

References 69 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, in the right part of Figure 5, 500 fine-tuning iterations from DAP already achieve ≥ 65 AP .5 , while the corresponding CLS numbers are lower than 20. This demonstrates that a better pre-trained model can provide faster convergence speed, which is consistent with [25,20,16,34].…”

Section: Discussionsupporting

confidence: 73%

“…Classification pre-training may sometimes even harm localization when the downstream data is abundant while benefit classification [25]. Shinya et al try to understand the impact of ImageNet classification pre-training on detection and discover that the pre-trained model generates narrower eigenspectrum than the fromscratch model [34]. Recent work proposes a cheaper Montage pre-training for detection on the target detection data and obtains an on-par or better performance than ImageNet classification pre-training [48].…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

DAP: Detection-Aware Pre-training with Weak Supervision

Zhong

Wang

et al. 2021

Preprint

View full text Add to dashboard Cite

This paper presents a detection-aware pre-training (DAP) approach, which leverages only weakly-labeled classification-style datasets (e.g., ImageNet) for pretraining, but is specifically tailored to benefit object detection tasks. In contrast to the widely used image classification-based pre-training (e.g., on ImageNet), which does not include any location-related training tasks, we transform a classification dataset into a detection dataset through a weakly supervised object localization method based on Class Activation Maps to directly pre-train a detector, making the pre-trained model location-aware and capable of predicting bounding boxes. We show that DAP can outperform the traditional classification pre-training in terms of both sample efficiency and convergence speed in downstream detection tasks including VOC and COCO. In particular, DAP boosts the detection accuracy by a large margin when the number of examples in the downstream task is small.

show abstract

Section: Discussionsupporting

confidence: 73%

Section: Related Workmentioning

confidence: 99%

DAP: Detection-Aware Pre-training with Weak Supervision

Zhong

Wang

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…In particular, the number of singular values larger than a given (plausible) threshold changes consistently. A similar observation was also made in [53] in the context of object detectors (but based on the normalized eigenspectrum of the Hessian). This observation may explain the different dynamics of the optimization scheme for the pretrained model and the model trained from scratch.…”

Section: Spectral Evaluation Of Pretrainingsupporting

confidence: 67%

An Educated Warm Start For Deep Image Prior-Based Micro CT Reconstruction

Barbano¹,

Leuschner²,

Schmidt³

et al. 2021

Preprint

View full text Add to dashboard Cite

Deep image prior [55] was recently introduced as an effective prior for image reconstruction. It represents the image to be recovered as the output of a deep convolutional neural network, and learns the network's parameters such that the output fits the corrupted observation. Despite its impressive reconstructive properties, the approach is slow when compared to learned or traditional reconstruction techniques. Our work develops a two-stage learning paradigm to address the computational challenge: (i) we perform a supervised pretraining of the network on a synthetic dataset; (ii) we fine-tune the network's parameters to adapt to the target reconstruction. We showcase that pretraining considerably speeds up the subsequent reconstruction from real-measured micro computed tomography data of biological specimens. The code and additional experimental materials are available at educateddip.github.io/docs.educated_deep_image_prior/.

show abstract

“…Epochs for the first learning rate decay, the second decay, and ending training are (8,11,12) for the 1× schedule, (16,22,24) for the 2× schedule, and (16,19,20) for the 20e schedule. To avoid overfitting by small learning rates [52], the 20e schedule is reasonable.…”

Section: D1 Common Settingsmentioning

confidence: 99%