Attention-Aware Deep Feature Embedding for Remote Sensing Image Scene Classification

Chen, Xiaoning; Han, Zonghao; Li, Yong; Ma, Mingyang; Mei, Shaohui; Cheng, Wei

doi:10.1109/jstars.2022.3229729

Cited by 9 publications

(9 citation statements)

References 68 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To further show the effect of our MAANet, we compare it with a set of state-of-the-art RSSC algorithms, covering traditional non-DL methods (i.e., BoVW, 7 IFK, 7 LDA, 7 LLC 8 ) that mainly rely on mid-level features and DL-based methods that are closely related to our network. Specifically, these DL models are subdivided into: (1) traditional CNNs (i.e., GoogLeNet, 7 CaffeNet, 7 VGG-VD-16, 7 and VGG-16-CapsNet 15 ); (2) gated networks (i.e., GBNet 18 and GBNet + global feature 18 ); (3) feature pyramid networks (i.e., EFPN-DSE-TDFF 19 and RANet 20 ); (4) global–local feature fusion networks (i.e., LCNN-BFF, 21 HABFNet, 22 MF2Net, 23 and DAFGCN 24 ); (5) attention-based networks (i.e., MS2AP, 25 MSA-Network, 26 SAFF, 27 ResNet50+EAM, 28 ACNet, 29 CSDS, 30 SEMSDNet, 31 ACR-MLFF, 32 CRAN, 33 and TDFE-DAA); 34 and (6) currently popular transformers (i.e., ViT-B_32, 35 T2T-ViT-12, 36 V16_21k, 37 ViT, 35 PVT-V2-B0, 38 PiT-S, 39 Swin-T, 40 PVT-Medium, 41 and T-CNN 42 ). For a fair comparison, all results are obtained by the source codes or provided by the authors directly.…”

Section: Experiences and Resultsmentioning

confidence: 99%

“…In recent works, many researchers have introduced attention into CNN-based RSSC, aiming to improve the RSSC performance. [25][26][27][28][29][30][31][32][33][34] For example, in Ref. 25 Currently, many researchers have attempted to apply the above transformers to the RS SC task.…”

Section: Introductionmentioning

confidence: 99%

“…In recent works, many researchers have introduced attention into CNN-based RSSC, aiming to improve the RSSC performance 25 – 34 For example, in Ref. 25, Bi et al.…”

Section: Introductionmentioning

confidence: 99%

“…In Ref. 34, Chen et al. designed a model via two-branch deep feature embedding and a dual attention-aware (TDFE-DAA) for RSSC.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Multi-attention aggregation network for remote sensing scene classification

Wang,

Li,

Shi

et al. 2023

J. Appl. Rem. Sens.

View full text Add to dashboard Cite

.Remote sensing (RS) scene classification is a highly challenging task because of the unique characteristics of RS scenes, such as high intra-class variability, large inter-class similarity, and various objects with different scales. Attention, interpreted as an important mechanism of the human visual system, can emphasize meaningful features of deep neural networks, which is beneficial for boosting the classification performance. Motivated by it, we present a multi-attention aggregation network (MAANet), which contains various specially designed attention models, for precise RS scene classification. First, a gated attention fluid coding structure is constructed for mining hierarchical gated attention features from RS images. Second, a progressive pyramid refinement architecture is designed to explore correlations of cross-layer attention features to learn enhanced multi-scale representations. Third, a two-stream attention aggregation structure, equipped with three different attention models, is developed to guide the generation of aggregated features. Finally, a scene label prediction module is proposed for scene label prediction. We conduct extensive experiments on three famous RS scene datasets, and the experimental results show that our MAANet outperforms a number of current representative state-of-the-art approaches for the RS scene classification task.

show abstract

Section: Experiences and Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

“…In recent works, many researchers have introduced attention into CNN-based RSSC, aiming to improve the RSSC performance 25 – 34 For example, in Ref. 25, Bi et al.…”

Section: Introductionmentioning

confidence: 99%

“…In Ref. 34, Chen et al. designed a model via two-branch deep feature embedding and a dual attention-aware (TDFE-DAA) for RSSC.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Multi-attention aggregation network for remote sensing scene classification

Wang,

Li,

Shi

et al. 2023

J. Appl. Rem. Sens.

View full text Add to dashboard Cite

show abstract

“…In order to fully verify the progress of our proposed method, we compared it with some state-of-the-art methods, including AlexNet [56], GoogleNet [34], CaffeNet [56], VGG-VD-16 [54], TEXNet [20], VGG16-CapsNet [57], VGG-VD-16-SAFF [58], ResNet-LGFFE [59], CSDS [60], MSRes-SplitNet [15], EFPN-DSE [49], TDFE-DAA [61], RANet [52] and EFPN-DSE-TDFF [49]. .…”

Section: Comparison With State-of-the-artsmentioning

confidence: 99%

A Dense Residual Correlation Attention Multi-scale Network for Remote Sensing Scene Classification

Dai,

Shi,

Wang

et al. 2024

Preprint

View full text Add to dashboard Cite

Remote sensing scene classification is a crucial task in the field of remote sensing technology, as it helps identify and categorize various geographical features on the Earth's surface. Recently, convolutional neural networks (CNNs) have been widely employed for scene classification in remote sensing images, leading to significant performance improvements. However, current CNN methods have certain limitations, primarily manifested in their tendency to overly focus on extracting high-level features, often overlooking essential information at different levels within the images. Furthermore, some approaches attempt to directly merge features from different levels, potentially resulting in issues of information redundancy or mutual exclusion. To address these challenges, this paper proposes an efficient method for remote sensing scene classification. Our approach involves three main contributions. First, we introduce a feature extraction module called multi-stream feature extraction, which effectively utilizes features at various scales and extracts features at different levels and depths. Second, we propose a dense residual connection feature fusion technique that enables comprehensive feature interactions and improves overall accuracy. Additionally, we introduce a correlated attention network to learn powerful feature representations at multiple levels, further improving classification performance. The method outperforms existing algorithms in terms of effectiveness and accuracy, achieving state-of-the-art results on four widely used remote sensing scene classification benchmarks.

show abstract

SCECNet: self-correction feature enhancement fusion network for remote sensing scene classification

Liu,

Wu,

et al. 2024

Earth Sci Inform

View full text Add to dashboard Cite

Attention-Aware Deep Feature Embedding for Remote Sensing Image Scene Classification

Cited by 9 publications

References 68 publications

Multi-attention aggregation network for remote sensing scene classification

Multi-attention aggregation network for remote sensing scene classification

A Dense Residual Correlation Attention Multi-scale Network for Remote Sensing Scene Classification

SCECNet: self-correction feature enhancement fusion network for remote sensing scene classification

Contact Info

Product

Resources

About