2021
DOI: 10.1109/jstars.2021.3051569
|View full text |Cite
|
Sign up to set email alerts
|

Attention Consistent Network for Remote Sensing Scene Classification

Abstract: Remote sensing (RS) image scene classification is an important research topic in the RS community, which aims to assign the semantics to the land covers. Recently, due to the strong behavior of convolutional neural network (CNN) in feature representation, the growing number of CNN-based classification methods has been proposed for RS images. Although they achieve cracking performance, there is still some room for improvement. First, apart from the global information, the local features are crucial to distingui… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
60
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 120 publications
(85 citation statements)
references
References 48 publications
0
60
0
Order By: Relevance
“…The skip-connected covariance (SCCov) network [12] directly uses covariance matrix of different convolution features as the image representation. Other works include PANet50 [19], ResNet-101+EAM [20] and ACNet [21], all focusing on the selfattention-based fusion strategies to enhance feature representations, and have achieved competitive performance. Amongst these methods, SCCov, as a typical second-order pooling method, cannot achieve better performances than other methods.…”
Section: B Experimental Results and Analysismentioning
confidence: 99%
“…The skip-connected covariance (SCCov) network [12] directly uses covariance matrix of different convolution features as the image representation. Other works include PANet50 [19], ResNet-101+EAM [20] and ACNet [21], all focusing on the selfattention-based fusion strategies to enhance feature representations, and have achieved competitive performance. Amongst these methods, SCCov, as a typical second-order pooling method, cannot achieve better performances than other methods.…”
Section: B Experimental Results and Analysismentioning
confidence: 99%
“…At the same time, we input the data set selected by our model into the newly published (Attention Consistent Network for Remote Sensing Scene Classification) [38] The models and algorithms in classification are mainly for high-precision image classification, and because the pixels of the model we used are too low and the similarity is very high, there is over fitting phenomenon when the model runs EuroSAT dataset. [42]and LEVIR (800 × 600) [43] are selected to test the model.…”
Section: B Analysis Of Experimental Resultsmentioning
confidence: 99%
“…Attention is widely used for various tasks, such as machine translation [52], scene classification [53], and semantic segmentation [54]. The early attention mechanism was only designed to learn channel-wise correlations.…”
Section: B Self-attention Mechanismmentioning
confidence: 99%