Attention Consistent Network for Remote Sensing Scene Classification

Tang, Xu; Ma, Qiushuo; Zhang, Xiangrong; Liu, Fang; Ma, Jingjing; Jiao, Licheng

doi:10.1109/jstars.2021.3051569

Cited by 120 publications

(85 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The skip-connected covariance (SCCov) network [12] directly uses covariance matrix of different convolution features as the image representation. Other works include PANet50 [19], ResNet-101+EAM [20] and ACNet [21], all focusing on the selfattention-based fusion strategies to enhance feature representations, and have achieved competitive performance. Amongst these methods, SCCov, as a typical second-order pooling method, cannot achieve better performances than other methods.…”

Section: B Experimental Results and Analysismentioning

confidence: 99%

First and Second-Order Information Fusion Networks for Remote Sensing Scene Classification

Samat

Zhang

et al. 2022

IEEE Geosci. Remote Sensing Lett.

View full text Add to dashboard Cite

Deep convolutional networks have been the most competitive method in remote sensing scene classification. Due to the diversity and complexity of scene content, remote sensing scene classification still remains a challenging task. Recently, the secondorder pooling method has attracted more interest because it can learn higher-order information and enhance the non-linear modeling ability of the networks. However, how to effectively learn second-order features and establish the discriminative feature representation of holistic images is still an open question. In this Letter, we propose a first and second-order information fusion networks (FSoI-Net) that can learn the first-order and secondorder features at the same time, and construct the final feature representation by fusing the two types of features. Specifically, a self-attention-based second-order pooling (SaSoP) method based on covariance matrix is proposed to extract second-order features, and a fusion loss function is developed to jointly train the model and construct the final feature representation for the classification decision. The proposed networks have been thoroughly evaluated on three real remote sensing scene datasets and achieved better performance than the counterparts.

show abstract

Section: B Experimental Results and Analysismentioning

confidence: 99%

First and Second-Order Information Fusion Networks for Remote Sensing Scene Classification

Samat

Zhang

et al. 2022

IEEE Geosci. Remote Sensing Lett.

View full text Add to dashboard Cite

show abstract

“…At the same time, we input the data set selected by our model into the newly published (Attention Consistent Network for Remote Sensing Scene Classification) [38] The models and algorithms in classification are mainly for high-precision image classification, and because the pixels of the model we used are too low and the similarity is very high, there is over fitting phenomenon when the model runs EuroSAT dataset. [42]and LEVIR (800 × 600) [43] are selected to test the model.…”

Section: B Analysis Of Experimental Resultsmentioning

confidence: 99%

A Lightweight Model of VGG-16 for Remote Sensing Image Classification

Chang

et al. 2021

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

In planetary science, it is an important basic work to recognize and classify the features of topography and geomorphology from the massive data of planetary remote sensing.Therefore, this paper proposes a lightweight model based on VGG-16, which can selectively extract some features of remote sensing images, remove redundant information, and recognize and classify remote sensing images. This model not only ensures the accuracy, but also reduces the parameters of the model.According to our experimental results, our model has a great improvement in remote sensing image classification, from the original accuracy of 85% to 98% now. At the same time, the model has a great improvement in convergence speed and classification performance.By inputting the remote sensing image data of ultralow pixels (64 * 64) into our model, we prove that our model still has a high accuracy rate of 95% for the remote sensing image with ultra-low pixels and less feature points.Therefore, the model has a good application prospect in remote sensing image fine classification, very low pixel, less image classification.

show abstract

“…Attention is widely used for various tasks, such as machine translation [52], scene classification [53], and semantic segmentation [54]. The early attention mechanism was only designed to learn channel-wise correlations.…”

Section: B Self-attention Mechanismmentioning

confidence: 99%

Hierarchical Self-Attention Embedded Neural Network With Dense Connection for Remote-Sensing Image Semantic Segmentation

Xia

et al. 2021

IEEE Access

View full text Add to dashboard Cite

Semantic segmentation of remote-sensing imagery strives to assign a pixel-wise semantic label. Since encoder-decoder networks have demonstrated tremendous success in natural image semantic segmentation, the adoption and extension of this kind of method are transferring such superior performance for the problems in remote-sensing. Facing the high-altitude angle of imaging and complex and diverse ground objects of remote-sensing data, it is necessary to strengthen the features' distinguishability by enhancing the network's capability. Nevertheless, the existing methods suffer from the structural stereotype, leveraging the short-range and long-range contextual information insufficiently. Attempting to address the problems mentioned above, a hierarchical self-attention embedded neural network with dense connection for remote sensing image semantic segmentation (HSDCN) is proposed. In the encoder stage, multiple selfattention modules (SAM) are embedded to model pixel-wise and channel-wise relationships at various scales hierarchically, making the representations more refined and discriminative. Then the dense connections are used to fuse the heterogeneous features. Thus, the network could produce logical and reasonable clues for labeling pixels. The extensive experiments are conducted on ISPRS Vaihingen and Potsdam benchmarks. And the results reveal significant improvements in comparison with other state-ofthe-art methods.

show abstract

Attention Consistent Network for Remote Sensing Scene Classification

Cited by 120 publications

References 48 publications

First and Second-Order Information Fusion Networks for Remote Sensing Scene Classification

First and Second-Order Information Fusion Networks for Remote Sensing Scene Classification

A Lightweight Model of VGG-16 for Remote Sensing Image Classification

Hierarchical Self-Attention Embedded Neural Network With Dense Connection for Remote-Sensing Image Semantic Segmentation

Contact Info

Product

Resources

About