2016
DOI: 10.1007/978-3-319-45886-1_2
|View full text |Cite
|
Sign up to set email alerts
|

Pixel-Level Encoding and Depth Layering for Instance-Level Semantic Labeling

Abstract: Recent approaches for instance-aware semantic labeling have augmented convolutional neural networks (CNNs) with complex multitask architectures or computationally expensive graphical models. We present a method that leverages a fully convolutional network (FCN) to predict semantic labels, depth and an instance-based encoding using each pixel's direction towards its corresponding instance center. Subsequently, we apply low-level computer vision techniques to generate stateof-the-art instance segmentation on the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
165
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 161 publications
(167 citation statements)
references
References 37 publications
(102 reference statements)
2
165
0
Order By: Relevance
“…Here, cues from an external object detector are fused with a semantic segmentation output using a Conditional Random Field (CRF) in order to segment the semantic segmentation output into instances. In earlier work, methods like InstanceCut [17] and the work by Uhrig et al [18] solved the same task with single unified networks, also relying on postprocessing steps to split semantic segmentation predictions into instances. However, they are outperformed by DIN.…”
Section: Jsis-netmentioning
confidence: 99%
“…Here, cues from an external object detector are fused with a semantic segmentation output using a Conditional Random Field (CRF) in order to segment the semantic segmentation output into instances. In earlier work, methods like InstanceCut [17] and the work by Uhrig et al [18] solved the same task with single unified networks, also relying on postprocessing steps to split semantic segmentation predictions into instances. However, they are outperformed by DIN.…”
Section: Jsis-netmentioning
confidence: 99%
“…There has also been work on joint learning of semantic segmentation and object detection. In [16], the authors describe an approach to instance segmentation using multi-task learning. For each pixel they predict the class label, depth and the direction to the next instance center using a single neural network.…”
Section: Related Workmentioning
confidence: 99%
“…[38] built a CNN-based architecture to jointly reason pixel-wise instance level segmentation as well as depth order from multi-scale image patches and they combined predictions into the final labeling via the MRF. [35] used a fully convolutional network (FCN) to jointly predict pixel-level semantic labels, depths and the directions to object centers from a single street scene image.…”
Section: Depth Orderingmentioning
confidence: 99%
“…Indeed, every single objective of building an amodal perception system is not new and has long been studied separately or jointly in the community. There is plenty of literature visual understanding despite the presence of occlusion [11,30,34,36], depth ordering [26,35,38] and object completion and inpainting [7,19], that together show the feasibility and practicability of building machine vision systems with such capabilities. In this work, we try to solve all of these problems in a single amodal segmentation framework.…”
Section: Introductionmentioning
confidence: 99%