2017
DOI: 10.48550/arxiv.1707.07958
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Residual Conv-Deconv Grid Network for Semantic Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
59
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 38 publications
(59 citation statements)
references
References 0 publications
0
59
0
Order By: Relevance
“…3D CNN networks are the natural choices to capture spatiotemporal features among video frames. However, the existing architectures for pixel-wise tasks (e.g., UNet-3D [26]) adopt a singlestream Encoder-Decoder style architecture that aggregates multi-scale features by the process of sequential downsampling and skip-connection which may result in information loss [13]. Inspired by the success of GridNet [14,36] in efficiently incorporating multi-resolution features, we formulate a novel 3D version of GridNet namely "GridNet-3D" by replacing its 2D convolutional filters with 3D convolutional filters.…”
Section: Non-linear Motion Estimation (Nme) Modulementioning
confidence: 99%
See 4 more Smart Citations
“…3D CNN networks are the natural choices to capture spatiotemporal features among video frames. However, the existing architectures for pixel-wise tasks (e.g., UNet-3D [26]) adopt a singlestream Encoder-Decoder style architecture that aggregates multi-scale features by the process of sequential downsampling and skip-connection which may result in information loss [13]. Inspired by the success of GridNet [14,36] in efficiently incorporating multi-resolution features, we formulate a novel 3D version of GridNet namely "GridNet-3D" by replacing its 2D convolutional filters with 3D convolutional filters.…”
Section: Non-linear Motion Estimation (Nme) Modulementioning
confidence: 99%
“…However, the existing architectures for pixel-wise tasks (e.g., UNet-3D [26]) adopt a singlestream Encoder-Decoder style architecture that aggregates multi-scale features by the process of sequential downsampling and skip-connection which may result in information loss [13]. Inspired by the success of GridNet [14,36] in efficiently incorporating multi-resolution features, we formulate a novel 3D version of GridNet namely "GridNet-3D" by replacing its 2D convolutional filters with 3D convolutional filters. GridNet-3D consists of three parallel streams to capture features with different resolutions and each stream has five convolutional blocks arranged in a sequence as shown in Fig.…”
Section: Non-linear Motion Estimation (Nme) Modulementioning
confidence: 99%
See 3 more Smart Citations