Detail-Preserving Pooling in Deep Networks

Saeedan, Faraz; Weber, Nicolas; Goesele, Michael; Roth, Stefan

doi:10.1109/cvpr.2018.00949

Cited by 119 publications

(97 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Detail Preserving Pooling [36,45] is a recently proposed pooling layer that is useful to preserve high-frequency details when performing pooling in CNNs. PAC can model the detail-preserving pooling operations by incorporating an adapting kernel that emphasizes more distinct pixels in the neighborhood, e.g.,…”

Section: Pixel-adaptive Convolutionmentioning

confidence: 99%

“…We observe that PAC, despite being a simple modification to standard convolution, is highly flexible and can be seen as a generalization of several widely-used filters. Specifically, we show that PAC is a generalization of spatial convolution, bilateral filtering [2,42], and pooling operations such as average pooling and detail-preserving pooling [36]. We also implement a variant of PAC that does pixel-adaptive transposed convolution (also called deconvolution) which can be used for learnable guided upsampling of intermediate CNN representations.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Pixel-Adaptive Convolutional Neural Networks

Jampani

Sun

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

264

197

View full text Add to dashboard Cite

Convolutions are the fundamental building blocks of CNNs. The fact that their weights are spatially shared is one of the main reasons for their widespread use, but it is also a major limitation, as it makes convolutions contentagnostic. We propose a pixel-adaptive convolution (PAC) operation, a simple yet effective modification of standard convolutions, in which the filter weights are multiplied with a spatially varying kernel that depends on learnable, local pixel features. PAC is a generalization of several popular filtering techniques and thus can be used for a wide range of use cases. Specifically, we demonstrate state-ofthe-art performance when PAC is used for deep joint image upsampling. PAC also offers an effective alternative to fully-connected CRF (Full-CRF), called PAC-CRF, which performs competitively compared to Full-CRF, while being considerably faster. In addition, we also demonstrate that PAC can be used as a drop-in replacement for convolution layers in pre-trained networks, resulting in consistent performance improvements.

show abstract

Section: Pixel-adaptive Convolutionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Pixel-Adaptive Convolutional Neural Networks

Jampani

Sun

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

264

197

View full text Add to dashboard Cite

show abstract

“…First, the prior knowledge that the maximum activation stands for the most discriminative detail, may not be always true. Second, the max operator over sliding windows hinders gradient-based optimization since in the backpropagation gradients are assigned only to the local maximums, as discussed in [33]. These sparse gradients would further enhance this inconsistence, in sense that discriminative activations will never become maximums unless current maximums are suppressed.…”

Section: Framework and Analysismentioning

confidence: 99%

“…Detail-preserving pooling. Recent proposed detailpreserving pooling (DPP) [33] uses the detail criterion as importance function F , which is measured by the deviations of features from the activation statistics in sliding windows. DPP solves the problem of max pooling by designing more sophisticated importance function and ensuring the continuity for better gradient optimization.…”

Section: Framework and Analysismentioning

confidence: 99%

“…For instance, LIP enables the network to preserve features of tiny targets while discarding false activations of the background clutter when recognizing or detecting small objects. Moreover, LIP is a more generic pooling method than the existing methods, in sense that it is capable of mimicking the behavior of average pooling, max pooling and detail-preserving pooling [33]. Experiments show LIP outperforms baseline methods by a large margin on ImageNet [8] with different architectures.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

LIP: Local Importance-Based Pooling

Gao

Wang

2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

View full text Add to dashboard Cite

Spatial downsampling layers are favored in convolutional neural networks (CNNs) to downscale feature maps for larger receptive fields and less memory consumption. However, for discriminative tasks, there is a possibility that these layers lose the discriminative details due to improper pooling strategies, which could hinder the learning process and eventually result in suboptimal models. In this paper, we present a unified framework over the existing downsampling layers (e.g., average pooling, max pooling, and strided convolution) from a local importance view. In this framework, we analyze the issues of these widely-used pooling layers and figure out the criteria for designing an effective downsampling layer. According to this analysis, we propose a conceptually simple, general, and effective pooling layer based on local importance modeling, termed as Local Importance-based Pooling (LIP). LIP can automatically enhance discriminative features during the downsampling procedure by learning adaptive importance weights based on inputs. Experiment results show that LIP consistently yields notable gains with different depths and different architectures on ImageNet classification. In the challenging MS COCO dataset, detectors with our LIP-ResNets as backbones obtain a consistent improvement (≥ 1.4%) over the vanilla ResNets, and especially achieve the current state-of-the-art performance in detecting small objects under the single-scale testing scheme.

show abstract

Hyperspectral Band Selection with Convolutional Neural Network

Cai

Yuan

2018

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Detail-Preserving Pooling in Deep Networks

Cited by 119 publications

References 23 publications

Pixel-Adaptive Convolutional Neural Networks

Pixel-Adaptive Convolutional Neural Networks

LIP: Local Importance-Based Pooling

Hyperspectral Band Selection with Convolutional Neural Network

Contact Info

Product

Resources

About