2018 IEEE Winter Conference on Applications of Computer Vision (WACV) 2018
DOI: 10.1109/wacv.2018.00153
|View full text |Cite
|
Sign up to set email alerts
|

DARN: A Deep Adversarial Residual Network for Intrinsic Image Decomposition

Abstract: We present a new deep supervised learning method for intrinsic decomposition of a single image into its albedo and shading components. Our contributions are based on a new fully convolutional neural network that estimates absolute albedo and shading jointly. Our solution relies on a single end-to-end deep sequence of residual blocks and a perceptually-motivated metric formed by two adversarially trained discriminators.As opposed to classical intrinsic image decomposition work, it is fully data-driven, hence do… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
68
0

Year Published

2018
2018
2019
2019

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 63 publications
(69 citation statements)
references
References 26 publications
1
68
0
Order By: Relevance
“…In addition, [15] proposes a model by introducing inter-links between decoder modules, based on the expectation that intrinsic components are correlated. Moreover, [16] demonstrates the capability of generative adversarial networks for the task. On the other hand, in more recent work, [17] considers an image formation loss together with gradient supervision to steer the learning process to achieve more vivid colors and sharper edges.…”
Section: Related Workmentioning
confidence: 97%
See 1 more Smart Citation
“…In addition, [15] proposes a model by introducing inter-links between decoder modules, based on the expectation that intrinsic components are correlated. Moreover, [16] demonstrates the capability of generative adversarial networks for the task. On the other hand, in more recent work, [17] considers an image formation loss together with gradient supervision to steer the learning process to achieve more vivid colors and sharper edges.…”
Section: Related Workmentioning
confidence: 97%
“…Baseline network architecture: The encoder part is composed of 6 convolution blocks with 3 × 3 kernels and stride of 2 having[16,32, 64, 128, 256, 256] feature maps. The encoder part is mirrored to build the decoder.…”
mentioning
confidence: 99%
“…Intrinsic image decomposition is a sub-problem of inverse rendering, where a single image is decomposed into albedo and shading. Recent methods learn intrinsic image decomposition from labeled synthetic data [17,26,34] and from unlabeled [20] or partially labeled real data [49,19,28,2]. Intrinsic image decomposition methods do not explicitly recover geometry or illumination but rather combine them together as shading.…”
Section: Related Workmentioning
confidence: 99%
“…Direct Intrinsics [37] is a successful early example of this type, using a (back then seemingly bold) multilayer CNN architecture to transform an image directly into shading and albedo. Successive models include the work of Kim et al [25] that predicts depth and the other intrinsic components together with a joint convolutional network that has shared intermediate layers and a joint CRF loss, and the DARN [32] network that incorporates a discriminator network and the adversarial training scheme to enhance of the performance of a "generator" network that produces the decomposition results. Image scale space and pyramid structures: The investigation of image scale space is no less old-fashioned than that of the intrinsic image decomposition in vision.…”
Section: Intrinsic Imagesmentioning
confidence: 99%
“…While models of similar ideas have been proposed (e.g. [37,32]), our model explores the scale space of the network input and output, and considers to simply the transformation as a whole by horizontally expanding the functor approximation pipeline into a parallel set of sub-band transformations.…”
Section: Introductionmentioning
confidence: 99%