2021
DOI: 10.1016/j.imavis.2021.104309
|View full text |Cite
|
Sign up to set email alerts
|

Attention-guided chained context aggregation for semantic segmentation

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(6 citation statements)
references
References 12 publications
0
4
0
Order By: Relevance
“…Then, we compare the proposed method with state-of-thearts methods (IEMNet [60], EMSANet [67], RAFNet [64], ESANet [24], RedNet [8], ACNet [23], SGNet [27], CANet [68], RDFNet [7], ShapeConv [66]) on the SUN RGB-D dataset. As depicted in Table 2, our approach consistently achieves a higher mIoU score on the SUN RGB-D dataset Visual comparisons on the NYU-Depth V2 dataset.…”
Section: Quantitative Experimental Results On Nyu-depth V2 and Sun Rg...mentioning
confidence: 99%
“…Then, we compare the proposed method with state-of-thearts methods (IEMNet [60], EMSANet [67], RAFNet [64], ESANet [24], RedNet [8], ACNet [23], SGNet [27], CANet [68], RDFNet [7], ShapeConv [66]) on the SUN RGB-D dataset. As depicted in Table 2, our approach consistently achieves a higher mIoU score on the SUN RGB-D dataset Visual comparisons on the NYU-Depth V2 dataset.…”
Section: Quantitative Experimental Results On Nyu-depth V2 and Sun Rg...mentioning
confidence: 99%
“…Although the upsampling operation based on bilinear interpolation [15] and nearest neighbor interpolation [16] can capture and restore the features extracted by the convolutional layer to a certain extent, its process does not consider the difference between each predicted pixel. Correlation, such as weak data-dependent convolutional decoders [17], cannot produce relatively high-quality feature maps. In this paper, the DUpsampling structure based on data correlation is added to the features extracted by the 3D-UNet network reconstruction encoding path so that the obtained feature map has better expressive ability.…”
Section: Dupsampling Structurementioning
confidence: 99%
“…The need for semantic segmentation in the context of Cityscapes emerges [3] from the need to glean useful insights from large-scale urban photographs and films. This necessitates the use of such technology.…”
Section: Introductionmentioning
confidence: 99%
“…([d19, f14]) f21 = Conv2D(c20, 512,(3,3), "same", 1, "relu") b21 = BatchNormalization(f21) f22 = Conv2D(b21, 512,(3,3), "same", 1, "relu") # Second Upsample m23 = UpSample(f22, size = (2, 2)) d23 = Dropout(m23, 0.2)PLOS ONE c24 = Concatenate([d23, f10]) f25 = Conv2D(c24, 256,(3,3), "same", 1, "relu") b25 = BatchNormalization(f25) f26 = Conv2D(b25, 256,(3,3), "same", 1, "relu") # Third Upsample m27 = UpSample(f26, size = (2, 2)) d27 = Dropout(m27, 0.2) c28 = Concatenate([d27, f6]) f29 = Conv2D(c28, 128,(3,3), "same", 1, "relu") b29 = BatchNormalization(f29) f30 = Conv2D(b29, 128, (3, 3), "same", 1, "relu") # Fourth Upsample m31 = UpSample(f30, size = (2, 2)) d31 = Dropout(m31, 0.2) c32 = Concatenate([d31, f2]) f33 = Conv2D(c32, 64, (3, 3), "same", 1, "relu") b33 = BatchNormalization(f33) f34 = Conv2D(b33, 64,(3,3), "same", 1, "relu")…”
mentioning
confidence: 99%