2019
DOI: 10.3390/rs11171996
|View full text |Cite|
|
Sign up to set email alerts
|

RETRACTED: Attention-Based Deep Feature Fusion for the Scene Classification of High-Resolution Remote Sensing Images

Abstract: Scene classification of highresolution remote sensing images (HRRSI) is one of the most important means of landcover classification. Deep learning techniques, especially the convolutional neural network (CNN) have been widely applied to the scene classification of HRRSI due to the advancement of graphic processing units (GPU). However, they tend to extract features from the whole images rather than discriminative regions. The visual attention mechanism can force the CNN to focus on discriminative regions, but … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
29
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 36 publications
(29 citation statements)
references
References 64 publications
0
29
0
Order By: Relevance
“…CNN is widely used in remote sensing scene classification [21][22][23] because of its powerful feature extraction capability in various fields such as object tracking [24][25][26], detection [27], and classification [28]. Compared to fine-tuning existing models [29], many researchers focus on the design of the network model [2,13,30], aiming to obtain a higher classification accuracy via additional modules. For example [13], in addition to using Resnet [31] to extract feature maps from the image, Zhang et al [13] designed a convolutional network to extract feature maps from attention maps [32].…”
Section: Motivationmentioning
confidence: 99%
See 3 more Smart Citations
“…CNN is widely used in remote sensing scene classification [21][22][23] because of its powerful feature extraction capability in various fields such as object tracking [24][25][26], detection [27], and classification [28]. Compared to fine-tuning existing models [29], many researchers focus on the design of the network model [2,13,30], aiming to obtain a higher classification accuracy via additional modules. For example [13], in addition to using Resnet [31] to extract feature maps from the image, Zhang et al [13] designed a convolutional network to extract feature maps from attention maps [32].…”
Section: Motivationmentioning
confidence: 99%
“…Compared to fine-tuning existing models [29], many researchers focus on the design of the network model [2,13,30], aiming to obtain a higher classification accuracy via additional modules. For example [13], in addition to using Resnet [31] to extract feature maps from the image, Zhang et al [13] designed a convolutional network to extract feature maps from attention maps [32]. These feature maps will be fused with the feature maps extracted by Resnet for classification.…”
Section: Motivationmentioning
confidence: 99%
See 2 more Smart Citations
“…Compared with traditional feature acquisition methods, the pre-trained ConvNets can extract more representative features. Therefore, the authors extracted the fully connected layer features of remote sensing images from pre-trained ConvNets to classify scenes in [36][37][38]. The literature [39][40][41] used the traditional feature coding methods [42,43] to encode and classify features of convolutional layers.…”
mentioning
confidence: 99%