2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00353
|View full text |Cite
|
Sign up to set email alerts
|

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks With Octave Convolution

Abstract: In natural images, information is conveyed at different frequencies where higher frequencies are usually encoded with fine details and lower frequencies are usually encoded with global structures. Similarly, the output feature maps of a convolution layer can also be seen as a mixture of information at different frequencies. In this work, we propose to factorize the mixed feature maps by their frequencies, and design a novel Octave Convolution (OctConv) operation 1 to store and process feature maps that vary sp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

3
299
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 473 publications
(302 citation statements)
references
References 42 publications
3
299
0
Order By: Relevance
“…Recently, there are some concurrent works aiming at improving the performance by utilizing the multi-scale features [5], [9], [11], [49]. Big-Little Net [5] is a multi-branch network composed of branches with different computational complexity.…”
Section: Concurrent Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Recently, there are some concurrent works aiming at improving the performance by utilizing the multi-scale features [5], [9], [11], [49]. Big-Little Net [5] is a multi-branch network composed of branches with different computational complexity.…”
Section: Concurrent Workmentioning
confidence: 99%
“…Big-Little Net [5] is a multi-branch network composed of branches with different computational complexity. Octave Conv [9] decomposes the standard convolution into two resolutions to process features at different frequencies. MSNet [11] utilizes a high-resolution network to learn high-frequency residuals by using the up-sampled low-resolution features learned by a lowresolution network.…”
Section: Concurrent Workmentioning
confidence: 99%
See 1 more Smart Citation
“…An octave convolutional layer [1] factorizes the output feature maps of a convolutional layer into two groups. The resolution of the low-frequency feature maps is reduced by an octaveheight and width dimensions are divided by 2.…”
Section: Multi-scale Octave Convolutionsmentioning
confidence: 99%
“…
We propose a multi-scale octave convolution layer to learn robust speech representations efficiently. Octave convolutions were introduced by Chen et al [1] in the computer vision field to reduce the spatial redundancy of the feature maps by decomposing the output of a convolutional layer into feature maps at two different spatial resolutions, one octave apart. This approach improved the efficiency as well as the accuracy of the CNN models.
…”
mentioning
confidence: 99%