2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00949
|View full text |Cite
|
Sign up to set email alerts
|

Detail-Preserving Pooling in Deep Networks

Abstract: Most convolutional neural networks use some method for gradually downscaling the size of the hidden layers. This is commonly referred to as pooling, and is applied to reduce the number of parameters, improve invariance to certain distortions, and increase the receptive field size. Since pooling by nature is a lossy process, it is crucial that each such layer maintains the portion of the activations that is most important for the network's discriminability. Yet, simple maximization or averaging over blocks, max… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
85
0
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 119 publications
(97 citation statements)
references
References 23 publications
0
85
0
1
Order By: Relevance
“…Detail Preserving Pooling [36,45] is a recently proposed pooling layer that is useful to preserve high-frequency details when performing pooling in CNNs. PAC can model the detail-preserving pooling operations by incorporating an adapting kernel that emphasizes more distinct pixels in the neighborhood, e.g.,…”
Section: Pixel-adaptive Convolutionmentioning
confidence: 99%
See 1 more Smart Citation
“…Detail Preserving Pooling [36,45] is a recently proposed pooling layer that is useful to preserve high-frequency details when performing pooling in CNNs. PAC can model the detail-preserving pooling operations by incorporating an adapting kernel that emphasizes more distinct pixels in the neighborhood, e.g.,…”
Section: Pixel-adaptive Convolutionmentioning
confidence: 99%
“…We observe that PAC, despite being a simple modification to standard convolution, is highly flexible and can be seen as a generalization of several widely-used filters. Specifically, we show that PAC is a generalization of spatial convolution, bilateral filtering [2,42], and pooling operations such as average pooling and detail-preserving pooling [36]. We also implement a variant of PAC that does pixel-adaptive transposed convolution (also called deconvolution) which can be used for learnable guided upsampling of intermediate CNN representations.…”
Section: Introductionmentioning
confidence: 99%
“…First, the prior knowledge that the maximum activation stands for the most discriminative detail, may not be always true. Second, the max operator over sliding windows hinders gradient-based optimization since in the backpropagation gradients are assigned only to the local maximums, as discussed in [33]. These sparse gradients would further enhance this inconsistence, in sense that discriminative activations will never become maximums unless current maximums are suppressed.…”
Section: Framework and Analysismentioning
confidence: 99%
“…Detail-preserving pooling. Recent proposed detailpreserving pooling (DPP) [33] uses the detail criterion as importance function F , which is measured by the deviations of features from the activation statistics in sliding windows. DPP solves the problem of max pooling by designing more sophisticated importance function and ensuring the continuity for better gradient optimization.…”
Section: Framework and Analysismentioning
confidence: 99%
See 1 more Smart Citation