2019
DOI: 10.1007/s11263-019-01188-y
|View full text |Cite
|
Sign up to set email alerts
|

Self-Supervised Model Adaptation for Multimodal Semantic Segmentation

Abstract: Learning to reliably perceive and understand the scene is an integral enabler for robots to operate in the realworld. This problem is inherently challenging due to the multitude of object types as well as appearance changes caused by varying illumination and weather conditions. Leveraging complementary modalities can enable learning of semantically richer representations that are resilient to such perturbations. Despite the tremendous progress in recent years, most multimodal convolutional neural network appro… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
139
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 200 publications
(157 citation statements)
references
References 62 publications
1
139
0
Order By: Relevance
“…In fact, different fusion techniques can be considered: early fusion, late fusion or multi-level fusion. Valada et al [19] adopt the latter technique by extracting and combining feature maps at different stages in the encoder from multiple input streams. In general most works, such as [7,2], show that late fusion can achieve better performance.…”
Section: Fusionmentioning
confidence: 99%
“…In fact, different fusion techniques can be considered: early fusion, late fusion or multi-level fusion. Valada et al [19] adopt the latter technique by extracting and combining feature maps at different stages in the encoder from multiple input streams. In general most works, such as [7,2], show that late fusion can achieve better performance.…”
Section: Fusionmentioning
confidence: 99%
“…Cityscapes data Additional data mIoU (%) Runtime (s) DRN_CRL_Coarse [37] Fine, Coarse ImagNet 82.8 -DPC [3] Fine, Coarse ImageNet, COCO [17] 82.7 -RelationNet_Coarse [36] Fine, Coarse ImageNet 82.4 -SSMA [32] Fine…”
Section: High Accuracy Network Methodsmentioning
confidence: 99%
“…ICNet [28] achieves great balance between efficiency and accuracy by using a hierarchical structure to save time on high-resolution feature maps. As regards RGB-D semantic segmentation, some studies have tried to utilizing the depth information to achieve better segmentation accuracy [9,13,18,25]. Hazirbas [9] presented a fusion-based CNN architecture which is consisted of two encoder branches for RGB and depth channel.…”
Section: Semantic Segmentationmentioning
confidence: 99%