2013 IEEE Conference on Computer Vision and Pattern Recognition 2013
DOI: 10.1109/cvpr.2013.163
|View full text |Cite
|
Sign up to set email alerts
|

Rotation, Scaling and Deformation Invariant Scattering for Texture Discrimination

Abstract: An affine invariant representation is constructed with a cascade of invariants, which preserves information for classification. A joint translation and rotation invariant representation of image patches is calculated with a scattering transform. It is implemented with a deep convolution network, which computes successive wavelet transforms and modulus non-linearities. Invariants to scaling, shearing and small deformations are calculated with linear operators in the scattering domain. State-of-the-art classific… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
310
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 360 publications
(329 citation statements)
references
References 16 publications
0
310
0
Order By: Relevance
“…Bruna et al (2013) define convolution operations over arbitrary graphs, generalizing from the typical grid of pixels to other locally connected structures. Sifre & Mallat (2013) extract representations that are invariant to affine transformations, based on scattering transforms. However, these representations are fixed (i.e., not learned from data), and not specifically tuned for the task at hand, unlike the representations learned by convolutional neural networks.…”
Section: Related Workmentioning
confidence: 99%
“…Bruna et al (2013) define convolution operations over arbitrary graphs, generalizing from the typical grid of pixels to other locally connected structures. Sifre & Mallat (2013) extract representations that are invariant to affine transformations, based on scattering transforms. However, these representations are fixed (i.e., not learned from data), and not specifically tuned for the task at hand, unlike the representations learned by convolutional neural networks.…”
Section: Related Workmentioning
confidence: 99%
“…Specifically, Xception's architecture is constructed based on a linear stack of a depth-wise separable convolution layer (i.e., 36 convolutional layers) with linear residual connections (see Figure 4). There are two important convolutional layers in this configuration: A depth-wise convolutional layer [37], where a spatial convolution is carried out independently in each channel of input data, and a pointwise convolutional layer, where a 1 × 1 convolutional layer maps the output channels to a new channel space using a depth-wise convolution. …”
Section: Xceptionmentioning
confidence: 99%
“…In this paper, we aim to improve the robustness of scene categorization by removing the distortions caused by arbitrary affine transformations. Because the (general) projective transformation can be locally approximated by affine transformations [6], the approach presented here constitutes a step towards developing a view-invariant scene categorization system.…”
Section: Introductionmentioning
confidence: 99%