2021
DOI: 10.1109/tgrs.2020.2991407
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Weakly Supervised Object Detection From High Spatial Resolution Remote Sensing Images via Dynamic Curriculum Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
57
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 129 publications
(57 citation statements)
references
References 35 publications
0
57
0
Order By: Relevance
“…Cheng et al [31] proposed a unified framework to generate and select high-quality proposals, which combines selective search [32] and a Gradient-weighted Class Activation Mapping [33] to generate more proposals with higher quality, and then chooses many confident positive proposals and only class-specific hard negatives to train more effective by upweighting the losses of discriminative negative proposals. To provide high-quality initial samples and obtain optimal object detectors with only image-level annotations, Yao et al [34] proposed a dynamic curriculum learning strategy with an entropy-based criterion and designed an effective instanceaware focal loss function, which can progressively learn the object detectors by feeding training images with increasing difficulty that matches current detection ability. To avoid selecting only one top-scoring proposal that usually results in learning a suboptimal object detector, Feng et al [35]proposed a novel end-to-end progressive contextual instance refinement method by leveraging both local and global context information for weakly supervised object detection.…”
Section: Semi-supervised Semantic Segmentationmentioning
confidence: 99%
“…Cheng et al [31] proposed a unified framework to generate and select high-quality proposals, which combines selective search [32] and a Gradient-weighted Class Activation Mapping [33] to generate more proposals with higher quality, and then chooses many confident positive proposals and only class-specific hard negatives to train more effective by upweighting the losses of discriminative negative proposals. To provide high-quality initial samples and obtain optimal object detectors with only image-level annotations, Yao et al [34] proposed a dynamic curriculum learning strategy with an entropy-based criterion and designed an effective instanceaware focal loss function, which can progressively learn the object detectors by feeding training images with increasing difficulty that matches current detection ability. To avoid selecting only one top-scoring proposal that usually results in learning a suboptimal object detector, Feng et al [35]proposed a novel end-to-end progressive contextual instance refinement method by leveraging both local and global context information for weakly supervised object detection.…”
Section: Semi-supervised Semantic Segmentationmentioning
confidence: 99%
“…The latest works on WSL based object detection from RSIs are [46,47]. In [46], Yao et al propose a dynamic curriculum learning strategy to perform weakly supervised object detection from high-resolution RSIs.…”
Section: B Weakly Supervised Learningmentioning
confidence: 99%
“…This work can progressively learn the object detectors by feeding training images with increasing difficulty that matches current detection ability. The work [47] designs a dual-contextual instance refinement strategy to divert the focus of detection network from local distinct part to the object and further to other potential instances by leveraging both local and global context information. With this, it can significantly boost object detection accuracy compared with the state of the arts.…”
Section: B Weakly Supervised Learningmentioning
confidence: 99%
“…For example, several weakly supervised object detection (WSOD) methods for remote sensing images have been proposed, e.g. the progressive contextual instance refinement method [33], and the dynamic curriculum learning method [34]. As to image-level WSSS for remote sensing images, it needs more spatial prior information for training a segmentation model compared with WSOD, and several pioneering works have also been proposed [20].…”
Section: Introductionmentioning
confidence: 99%