2018
DOI: 10.1109/tpami.2017.2699184
|View full text |Cite
|
Sign up to set email alerts
|

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Abstract: In this work we address the task of semantic image segmentation with Deep Learning and make three main contributions that are experimentally shown to have substantial practical merit. First, we highlight convolution with upsampled filters, or 'atrous convolution', as a powerful tool in dense prediction tasks. Atrous convolution allows us to explicitly control the resolution at which feature responses are computed within Deep Convolutional Neural Networks. It also allows us to effectively enlarge the field of v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

39
12,830
0
32

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 17,402 publications
(13,861 citation statements)
references
References 120 publications
39
12,830
0
32
Order By: Relevance
“…In each CRF iteration, the mutual interaction between pixels is calculated using the energy function [41]. The superpixel classification results obtained by the CNN are usually processed using the fully connected CRF.…”
Section: Fully Connected Crfmentioning
confidence: 99%
“…In each CRF iteration, the mutual interaction between pixels is calculated using the energy function [41]. The superpixel classification results obtained by the CNN are usually processed using the fully connected CRF.…”
Section: Fully Connected Crfmentioning
confidence: 99%
“…It is widely used for image semantic segmentation and patch-level labeling [11][12][13][14][15]18 by addressing computer vision problems with CRF inference. Kumar and Hebert 18 proposed the discriminative random field, which inherits the CRF concept for labeling man-made structures at patch level.…”
Section: Conditional Random Fieldmentioning
confidence: 99%
“…Owing to the need to solve excessive boundary smoothing for semantic segmentation using an adjacency CRF structure, Krähenbühl and Koltun 14 proposed a fully connected CRF that establishes pairwise potentials consisting of a linear combination of Gaussian kernels on all pairs of pixels in the image. Chen et al 15 proposed a DeepLab system that utilizes a fully connected CRF coupled with a deep convolutional network-based pixel-level classifier as well as long range dependencies to capture fine edge details. Yang and Yang 17 proposed a top-down saliency model by constructing a CRF upon SC of image patches; the codebook was optimized by jointly learning the CRF model.…”
Section: Conditional Random Fieldmentioning
confidence: 99%
See 1 more Smart Citation
“…However, traditional FCN methods have strong invariance but lack equivariance. Thus, even the state of the art methods such as Deeplab [24] cannot get clear and detailed labeling result of the rafts.…”
Section: Introductionmentioning
confidence: 99%