2021
DOI: 10.5617/nmi.9131
|View full text |Cite
|
Sign up to set email alerts
|

Employing GRU to combine feature maps in DeeplabV3 for a better segmentation model

Abstract: In this paper, we aim to enhance the segmentation capabilities of DeeplabV3 by employing Gated Recurrent Neural Network (GRU). A 1-by-1 convolution in DeeplabV3 was replaced by GRU after the Atrous Spatial Pyramid Pooling (ASSP) layer to combine the input feature maps. The convolution and GRU have sharable parameters, though, the latter has gates that enable/disable the contribution of each input feature map. The experiments on unseen test sets demonstrate that employing GRU instead of convolution would produc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
2
2
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 6 publications
0
1
0
Order By: Relevance
“…To compensate for the lost detailed information in high-level features, Gated Fully Fusion (GFF) [28] is used to selectively fuse features from multiple levels using gates in a fully connected way. To collect contextual information and produce relevant global information, Mahmood et al [29] modify SegNet architecture to embed Gated Recurrent Units (GRU) within the convolution layers. Meanwhile, multiple CRF-GRU layers [30] are injected into an FCN to model hierarchical contexts and show competitive results for semantic segmentation.…”
Section: Introductionmentioning
confidence: 99%
“…To compensate for the lost detailed information in high-level features, Gated Fully Fusion (GFF) [28] is used to selectively fuse features from multiple levels using gates in a fully connected way. To collect contextual information and produce relevant global information, Mahmood et al [29] modify SegNet architecture to embed Gated Recurrent Units (GRU) within the convolution layers. Meanwhile, multiple CRF-GRU layers [30] are injected into an FCN to model hierarchical contexts and show competitive results for semantic segmentation.…”
Section: Introductionmentioning
confidence: 99%