2020
DOI: 10.1007/978-3-030-58598-3_8
|View full text |Cite
|
Sign up to set email alerts
|

Enabling Deep Residual Networks for Weakly Supervised Object Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 34 publications
(14 citation statements)
references
References 69 publications
1
13
0
Order By: Relevance
“…We use ResNet50 as the backbone by default and also report the results using VGG16. We observe that the results using VGG16 are only slightly worse than those using ResNet50, which is consistent with the observation in [33]. One possible explanation is that the MIL classifier may back-propagate uncertain and inaccurate gradients to backbones while skip connection in ResNet50 can not alleviate this issue.…”
Section: Experiments On Coco-60supporting
confidence: 87%
“…We use ResNet50 as the backbone by default and also report the results using VGG16. We observe that the results using VGG16 are only slightly worse than those using ResNet50, which is consistent with the observation in [33]. One possible explanation is that the MIL classifier may back-propagate uncertain and inaccurate gradients to backbones while skip connection in ResNet50 can not alleviate this issue.…”
Section: Experiments On Coco-60supporting
confidence: 87%
“…Training the semi-autonomous learning network with ResNet backbone will reduce the identification of the proposed features, and will be weak in localizing object instances. Discovered by [36], the proposed semi-autonomous learning algorithm in this paper takes MRN as the backbone network.…”
Section: Modified Residual Network For Semi-autonomous Learningmentioning
confidence: 99%
“…This is because non-maximum down-sampling may not retain the activation and gradient of the information flowing through the network under weak supervision. Inspired by [36], a MRN is proposed in this paper, and the small kernel convolution and max-pooling are used to improve the robustness of information flow, which makes the object boundary more detailed. Specifically, the original stem block is replaced by three conservative 3 × 3 convolutions, and the first and third convolutions are followed by the max-pooling layer.…”
Section: mentioning
confidence: 99%
See 1 more Smart Citation
“…In this case, earlier detection algorithms will be more complicated in design, and the detection effect cannot meet the actual demand. After more than a decade of development, deep neural networks have gradually matured, and many high-level network design solutions have emerged, becoming the mainstream algorithm for solving object detection problems [10][11][12][13][14][15][16][17][18][19][20]. Among these methods, Faster-RCNN [10] provides a new idea to accomplish the task of multi-category target detection for images on an efficient and high accuracy basis.…”
Section: Introductionmentioning
confidence: 99%