2021
DOI: 10.3390/rs13152986
|View full text |Cite
|
Sign up to set email alerts
|

Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation

Abstract: Semantic segmentation of remote sensing imagery is a fundamental task in intelligent interpretation. Since deep convolutional neural networks (DCNNs) performed considerable insight in learning implicit representations from data, numerous works in recent years have transferred the DCNN-based model to remote sensing data analysis. However, the wide-range observation areas, complex and diverse objects and illumination and imaging angle influence the pixels easily confused, leading to undesirable results. Therefor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 41 publications
0
8
0
Order By: Relevance
“…AFNet [61] designs a multi-path encoder structure, multi-path attentionfused module and refinement attention-fused block for semantic segmentation of very high resolution remote sensing images HMANet [38] hybridizes three-level representations to augment learnt features, including space, channel and category. Similarly, Li et al proposed HCANet [62] that extracts and aggregates cross-level contextual information from pixels, superpixels, and global space. In addition, an adaptive fusion network was designed by Li et al [63], enhancing the feature points from either low-level or high-level feature maps with the constructed attention modules.…”
Section: B Semantic Segmentation Of Remote Sensing Imagesmentioning
confidence: 99%
“…AFNet [61] designs a multi-path encoder structure, multi-path attentionfused module and refinement attention-fused block for semantic segmentation of very high resolution remote sensing images HMANet [38] hybridizes three-level representations to augment learnt features, including space, channel and category. Similarly, Li et al proposed HCANet [62] that extracts and aggregates cross-level contextual information from pixels, superpixels, and global space. In addition, an adaptive fusion network was designed by Li et al [63], enhancing the feature points from either low-level or high-level feature maps with the constructed attention modules.…”
Section: B Semantic Segmentation Of Remote Sensing Imagesmentioning
confidence: 99%
“…The design of the attention mechanism is inspired by the human visual system. In application to the semantic segmentation of natural images, various attention mechanisms are introduced into models, which have achieved great performance in mining visual features [47,48]. In SENet, SE block was proposed to enhance the correlation of interchannel features using a global averaging pooling operation in the channel dimension [49].…”
Section: Attention Mechanismmentioning
confidence: 99%
“…In addition, the model's receptive field has a significant impact on its performance, and thus dilated convolution [14,15,17,34] and global pooling [35] have been introduced into the field of semantic segmentation, achieving significant performance improvement. At the same time, some models [13,32,36,37] leveraged attention mechanisms [16,31,[38][39][40][41][42] to capture long-distance dependency for semantic segmentation, which reach state-of-the-art performance. However, all of the aforementioned methods heavily rely on large-scale, accurate pixel-level labelled dataset.…”
Section: Semantic Segmentationmentioning
confidence: 99%