Procedings of the British Machine Vision Conference 2017 2017
DOI: 10.5244/c.31.181
|View full text |Cite
|
Sign up to set email alerts
|

Residual Conv-Deconv Grid Network for Semantic Segmentation

Abstract: This paper presents GridNet, a new Convolutional Neural Network (CNN) architecture for semantic image segmentation (full scene labelling). Classical neural networks are implemented as one stream from the input to the output with subsampling operators applied in the stream in order to reduce the feature maps size and to increase the receptive field for the final prediction. However, for semantic image segmentation, where the task consists in providing a semantic class to each pixel of an image, feature maps red… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
108
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 187 publications
(108 citation statements)
references
References 17 publications
0
108
0
Order By: Relevance
“…Maintaining high-resolution representations. Our work is closely related to several works that can also generate highresolution representations, e.g., convolutional neural fabrics [98], interlinked CNNs [150], GridNet [29], and multiscale DenseNet [43].…”
Section: Related Workmentioning
confidence: 98%
See 1 more Smart Citation
“…Maintaining high-resolution representations. Our work is closely related to several works that can also generate highresolution representations, e.g., convolutional neural fabrics [98], interlinked CNNs [150], GridNet [29], and multiscale DenseNet [43].…”
Section: Related Workmentioning
confidence: 98%
“…The two early works, convolutional neural fabrics [98] and interlinked CNNs [150], lack careful design on when to start low-resolution parallel streams, and how and where to exchange information across parallel streams, and do not use batch normalization and residual connections, thus not showing satisfactory performance. GridNet [29] is like a combination of multiple U-Nets and includes two symmetric information exchange stages: the first stage passes information only from high resolution to low resolution, and the second stage passes information only from low resolution to high resolution. This limits its segmentation quality.…”
Section: Related Workmentioning
confidence: 99%
“…The exploration of aggregating hierarchical feature has recently been the subject of research. Fourure et al [45] propose GridNet, which is an encoder-decoder architecture wherein the feature maps are wired in a grid fashion, generalizing several classical segmentation architectures. Despite GridNet contains multiple streams with different resolutions, it lacks up-sampling layers between skip connections; and thus, it does not represent UNet++.…”
Section: B Feature Aggregationmentioning
confidence: 99%
“…3. Attention-based multi-scale estimation: Inspired by [7], we implement multi-scale estimation on a grid network. The grid network has clear advantages over the encoderdecoder network and the conventional multi-scale network extensively used in image restoration [18,41,38,27].…”
Section: Trainable Pre-processing Modulementioning
confidence: 99%