SCAttNet: Semantic Segmentation Network With Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images

Li, Haifeng; Qiu, Kaijian; Chen, Li; Mei, Xiaohong; Liang, Hong; Tao, Chao

doi:10.1109/lgrs.2020.2988294

Cited by 167 publications

(63 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The increasing number of HSR-RSIs enable building largescale segmentation datasets that play an indispensable part in advance of semantic segmentation. In the past few years, several publicly available HSR-RSIs benchmark datasets have been proposed by different research groups for LUM of remote sensing images [26]- [35].…”

Section: Datasets For Dl-based Lummentioning

confidence: 99%

See 1 more Smart Citation

Land-Use Mapping for High-Spatial Resolution Remote Sensing Image Via Deep Learning: A Review

Zang

Cao

Wang

et al. 2021

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

Land-use mapping (LUM) using high spatial resolution remote sensing images (HSR-RSIs) is a challenging and crucial technology. However, due to the characteristics of HSR-RSIs, such as different image acquisition conditions and massive, detailed information, performing LUM faces unique scientific challenges. With the emergence of new deep learning (DL) algorithms in recent years, methods to LUM with DL have achieved huge breakthroughs, which offers novel opportunities for the development of LUM for HSR-RSIs. This paper aims to provide a thorough review of recent achievements in this field. Existing high spatial resolution datasets in the research of semantic segmentation and single object segmentation are presented firstly. Next, we introduce several basic DL approaches that are frequently adopted for LUM. After reviewing DLbased LUM methods comprehensively, which highlights the contributions of researchers in the field of LUM for HSR-RSIs, we summarize these DL-based approaches based on two LUM criteria. Individually, the first one has supervised learning, semisupervised learning, or unsupervised learning, while another one is pixel-based or object-based. We then briefly review the fundamentals and the developments of the development of semantic segmentation and single object segmentation. At last, quantitative results that experiment on the dataset of ISPRS Vaihingen and ISPRS Potsdam are given for several representative models such as FCN and U-Net, following up with a comparison and discussion of the results.

show abstract

Section: Datasets For Dl-based Lummentioning

confidence: 99%

“…We split the labeled images of the Vaihingen dataset into a training dataset (12 images of ID 1,3,5,7,11,13,15,17,21,23,26,28) and a test dataset (4 images of ID 30,32,34,37). We randomly crop the training images into a size of 256 × 256 and flip and rotation images for data augmentation.…”

Section: Implementation Detailsmentioning

confidence: 99%

Land-Use Mapping for High-Spatial Resolution Remote Sensing Image Via Deep Learning: A Review

Zang

Cao

Wang

et al. 2021

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

show abstract

“…Misra et al [47] proposed a method to compute the attention weights by capturing the crossdimension interaction using a three-branch structure. In SCAt-tNet V2 [48], both channel and space attention modules were utilized to adaptively refine the extracted features of remote sensing images. In addition, by considering the geometrical structure information of remote sensing data, a spatial-spectral second-order attention module, a multiscale residual attention module, and a tensor attention module can produce satisfactory classification results in data fusion and classification [49], [50].…”

Section: Attention Mechanismmentioning

confidence: 99%

“…Moreover, in the U-Net [10] and FCN 32s models, VGG-16 [52] is selected as the backbone. For RefineNet [38], Deeplabv3+ [37], PSPNet [17], SCAttNet V2 [48], ResUNet-a [14], and CBAM [56], ResNet50 [53] is used as the backbone, which is downsampled 32 times for RefineNet, 8 times for Deeplabv3+, 8 times for SCAttNet V2, 8 times for PSPNet, and 8 times for ResUNet-a, while 8 times for CBAM. In DSPCANet (IRRG), only the IRRG images are considered.…”

Section: B Implementation Detailsmentioning

confidence: 99%

DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images

et al. 2021

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

“…Unlike previous works that capture contexts by multi-scale feature fusion, Dual Attention Network (DANet) [19] adaptively integrate local features with their global dependencies, which model the semantic interdependencies in spatial and channel dimensions respectively. SCAttNet [20] proposed by Haifeng Li combines spatial attention with channel attention to segment remote sensing imagery. In summary, the following problems still exist when using deep learning networks to segment remote sensing imagery:…”

Section: Introductionmentioning

confidence: 99%

Adaptive Filtering Remote Sensing Image Segmentation Network based on Attention Mechanism

Wu¹,

Dong²,

Lin³

et al. 2021

Computer Science &Amp; Information Technology (CS &Amp; IT)

View full text Add to dashboard Cite

It is difficult to segment small objects and the edge of the object because of larger-scale variation, larger intra-class variance of background and foreground-background imbalance in the remote sensing imagery. In convolutional neural networks, high frequency signals may degenerate into completely different ones after downsampling. We define this phenomenon as aliasing. Meanwhile, although dilated convolution can expand the receptive field of feature map, a much more complex background can cause serious alarms. To alleviate the above problems, we propose an attention-based mechanism adaptive filtered segmentation network. Experimental results on the Deepglobe Road Extraction dataset and Inria Aerial Image Labeling dataset showed that our method can effectively improve the segmentation accuracy. The F1 value on the two data sets reached 82.67% and 85.71% respectively.

show abstract

SCAttNet: Semantic Segmentation Network With Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images

Cited by 167 publications

References 26 publications

Land-Use Mapping for High-Spatial Resolution Remote Sensing Image Via Deep Learning: A Review

Land-Use Mapping for High-Spatial Resolution Remote Sensing Image Via Deep Learning: A Review

DSPCANet: Dual-Channel Scale-Aware Segmentation Network With Position and Channel Attentions for High-Resolution Aerial Images

Adaptive Filtering Remote Sensing Image Segmentation Network based on Attention Mechanism

Contact Info

Product

Resources

About