2019
DOI: 10.3390/app10010083
|View full text |Cite
|
Sign up to set email alerts
|

Object Detection with Low Capacity GPU Systems Using Improved Faster R-CNN

Abstract: Object detection in remote sensing images has been frequently used in a wide range of areas such as land planning, city monitoring, traffic monitoring, and agricultural applications. It is essential in the field of aerial and satellite image analysis but it is also a challenge. To overcome this challenging problem, there are many object detection models using convolutional neural networks (CNN). The deformable convolutional structure has been introduced to eliminate the disadvantage of the fixed grid structure… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
10
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(11 citation statements)
references
References 13 publications
0
10
0
Order By: Relevance
“…The results given in Table 3 reveal that the proposed model provides better results than those in other studies. Although our method ensures high performance, we observed a discrepancy in the detection accuracy for the same category of object (e.g., storage tank) as in the DODN [12] and Improved Faster R-CNN [13] models. This is mainly because of the imbalance among the classes in the NWPU-VHR10 database.…”
mentioning
confidence: 78%
See 2 more Smart Citations
“…The results given in Table 3 reveal that the proposed model provides better results than those in other studies. Although our method ensures high performance, we observed a discrepancy in the detection accuracy for the same category of object (e.g., storage tank) as in the DODN [12] and Improved Faster R-CNN [13] models. This is mainly because of the imbalance among the classes in the NWPU-VHR10 database.…”
mentioning
confidence: 78%
“…The present model is based on the latest Faster R-CNN (Figure 3) [26], a state-of-the-art object detection system. The deformable convolution [27] technique is used to overcome the weaknesses of the regular convolution structure used in the Faster R-CNN model for detecting small and mixed objects in remote sensing images [13]. With the Feature Pyramid Network (FPN) [28] technique, the high-resolution features in the shallow layers of the remote sensing images are transferred to the network.…”
Section: Improved Faster R-cnnmentioning
confidence: 99%
See 1 more Smart Citation
“…In Equations (4) and (5), D represents the number of layers of the CNN, i.e., the depth of the network; l represents the lth convolution layer of the CNN; M l represents the side length of the output feature map for the lth convolution layer; K represents the side length of each convolution kernel; C l−1 represents the number of input channels of the lth convolution layer, i.e., the number of output channels of the (l − 1)th convolution layer; and C l represents the number of output channels of the lth convolution layer, i.e., the number of convolution kernels of this layer.…”
Section: Methodsmentioning
confidence: 99%
“…Feature extraction and classification of original images can be conducted via multi-layer convolution operations, and the position of an object in an image can be predicted using boundary boxes, providing the capability of visual understanding. The results of these studies can be widely applied in facial recognition [1], attitude prediction [2], video surveillance, and a variety of other intelligent applications [3][4][5].…”
Section: Introductionmentioning
confidence: 99%