Last Layer Marginal Likelihood for Invariance Learning

Schwöbel, Pola; Jørgensen, Martin Bak; Ober, Sebastian W.; Wilk, Mark van der

doi:10.48550/arxiv.2106.07512

Cited by 1 publication

(2 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In [4] and [8] symmetries are selected using a regularized training loss directly. Many symmetry discovery methods focus on learning invariances [27,4,22,26,10], which are easier to parameterize. This works offers a way to parameterize continuous equivariance constraints, which could allow extensions of symmetry discovery approaches to learnable equivariances.…”

Section: Related Workmentioning

confidence: 99%

“…Secondly, automatically learning symmetry structure from data is an interesting problem. Work in this field which often focuses on invariances [27,4,26,22,10], which are easier to parameterize than equivariances. Parameterizations that allow smooth and adjustable symmetry constraints could extends such methods to layer-by-layer learnable equivariances.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Relaxing Equivariance Constraints with Non-stationary Continuous Filters

Ouderaa¹,

Romero²,

Wilk³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Equivariances provide useful inductive biases in neural network modeling, with the translation equivariance of convolutional neural networks being a canonical example. Equivariances can be embedded in architectures through weight-sharing and place symmetry constraints on the functions a neural network can represent. The type of symmetry is typically fixed and has to be chosen in advance. Although some tasks are inherently equivariant, many tasks do not strictly follow such symmetries. In such cases, equivariance constraints can be overly restrictive. In this work, we propose a parameter-efficient relaxation of equivariance that can effectively interpolate between a (i) non-equivariant linear product, (ii) a strict-equivariant convolution, and (iii) a strictly-invariant mapping. The proposed parameterization can be thought of as a building block to allow adjustable symmetry structure in neural networks. Compared to non-equivariant or strict-equivariant baselines, we experimentally verify that soft equivariance leads to improved performance in terms of test accuracy on CIFAR-10 and CIFAR-100 image classification tasks.

show abstract