2021
DOI: 10.1109/jstars.2021.3117857
|View full text |Cite
|
Sign up to set email alerts
|

CSDS: End-to-End Aerial Scenes Classification With Depthwise Separable Convolution and an Attention Mechanism

Abstract: Compared with natural scenes, aerial scenes are usually composed of numerous objects densely distributed within the aerial view, and thus more key local semantic features are needed to describe them. However, when existing CNNs are used for remote sensing image classification, they typically focus on the global semantic features of the image, and especially for deep models, shallow and intermediate features are easily lost. This paper proposes a channel-spatial attention mechanism based on a depthwise separabl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 31 publications
(18 citation statements)
references
References 59 publications
0
18
0
Order By: Relevance
“…They used two kinds of attention modules (channel and spatial attention modules) to explore the correlations between image pixels derived from the channel and spatial dimensions, respectively. Wang et al [13] combined a channel-spatial attention algorithm and DS-Conv to propose a lightweight network called CSDS for remote sensing image classification.…”
Section: Attention Mechanismsmentioning
confidence: 99%
See 1 more Smart Citation
“…They used two kinds of attention modules (channel and spatial attention modules) to explore the correlations between image pixels derived from the channel and spatial dimensions, respectively. Wang et al [13] combined a channel-spatial attention algorithm and DS-Conv to propose a lightweight network called CSDS for remote sensing image classification.…”
Section: Attention Mechanismsmentioning
confidence: 99%
“…Many classic networks, such as the Visual Geometry Group Network (VGGNet), AlexNet [9], and GoogLeNet [2], have demonstrated strong feature extraction capabilities and have been applied to remote sensing image classification. Many improved methods based on classic networks have also achieved state-of-the-art performance [10][11][12][13].…”
Section: Introductionmentioning
confidence: 99%
“…In FACNN, a supervised convolutional feature encoding module and a progressive aggregation strategy are proposed to aggregate intermediate features with semantic label information to achieve advanced classification accuracy. Wang et al 41 proposed a network for aerial scene classification based on deep separable convolution and spatial attention mechanism called CSDS. The model uses depth-separable convolution to extract features from channels and a residual pyramidal structure to connect and associate multiple layers of features with achieving state-of-the-art recognition accuracy.…”
Section: Classification Based On Depth Featuresmentioning
confidence: 99%
“…In FACNN, a supervised convolutional feature encoding module and a progressive aggregation strategy are proposed to aggregate intermediate features with semantic label information to achieve advanced classification accuracy. Wang et al 41 . proposed a network for aerial scene classification based on deep separable convolution and spatial attention mechanism called CSDS.…”
Section: Related Workmentioning
confidence: 99%
“…After the twenty-first century, with the continuous development and maturity of hyperspectral images (HSI) technology and related theories, it has broad application prospects in the field of grassland ecology 5 . Hyperspectral for parameter detection has the advantages of multiple bands, high sensitivity and non-destructiveness 6 , 7 . It facilitates grass classification with study at close range.…”
Section: Introductionmentioning
confidence: 99%