2020
DOI: 10.48550/arxiv.2009.06911
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Multi-scale Attention U-Net (MsAUNet): A Modified U-Net Architecture for Scene Segmentation

Abstract: Despite the growing success of Convolution neural networks (CNN) in the recent past in the task of scene segmentation, the standard models lack some of the important features that might result in sub-optimal segmentation outputs. The widely used encoder-decoder architecture extracts and uses several redundant and low-level features at different steps and different scales. Also, these networks fail to map the long-range dependencies of local features, which results in discriminative feature maps corresponding t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 27 publications
0
7
0
Order By: Relevance
“…In this paper, we compared the proposed SegMarsViT with existing lightweight semantic segmentation methods. We evaluate SegMarsViT against eight SOTA natural image semantic segmentation methods, including FCN [10], DeepLabV3+ [50], Segmenter [51], PSPNet [52], PSANet [53], SegFormer [38], and FPN-PoolFormer [54].…”
Section: Comparison With Sota Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In this paper, we compared the proposed SegMarsViT with existing lightweight semantic segmentation methods. We evaluate SegMarsViT against eight SOTA natural image semantic segmentation methods, including FCN [10], DeepLabV3+ [50], Segmenter [51], PSPNet [52], PSANet [53], SegFormer [38], and FPN-PoolFormer [54].…”
Section: Comparison With Sota Methodsmentioning
confidence: 99%
“…Long et al [6] first proposed a fully convolutional network (FCNet), which is a revolutionary work and the majority of following state-of-the-art (SOTA) studies are extensions of the FCN architecture. One of the most pioneering works is UNet presented by Ronneberger et al [7] for biomedical image segmentation, which adopts the influential encoder-decoder architecture and proved to be very useful for other types of image data [8][9][10][11]. Meanwhile, inspired by the high precision that CNNs achieved in semantic segmentation, many CNNs-based approaches were proposed for the Martian terrain segmentation (MTS) task.…”
Section: Introductionmentioning
confidence: 99%
“…Then, a variety of special information extraction structures are drawn according to this working mechanism to automatically learn and calculate the contribution of input data to the output data. Attention mechanisms have proven to be useful in fields such as scene segmentation [ 39 , 40 ], image understanding [ 41 , 42 ], fine-grained visual classification [ 43 , 44 ], and image inpainting [ 45 , 46 ].…”
Section: Related Workmentioning
confidence: 99%
“…Attention mechanisms are also widely known for boosting the performance of CNN-based models in different computer vision applications. Chattopadhyay et al [15] proposed a multi-scale attention mechanism which is inspired by the work of [9] for accurate localization and segmentation of objects. The dual attention mechanism was proposed by [19] adaptively integrates the local features with their corresponding global dependencies.…”
Section: Introductionmentioning
confidence: 99%