2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00017
|View full text |Cite
|
Sign up to set email alerts
|

Jointly Aligning Millions of Images With Deep Penalised Reconstruction Congealing

Abstract: Extrapolating fine-grained pixel-level correspondences in a fully unsupervised manner from a large set of misaligned images can benefit several computer vision and graphics problems, e.g. co-segmentation, super-resolution, image edit propagation, structure-from-motion, and 3D reconstruction. Several joint image alignment and congealing techniques have been proposed to tackle this problem, but robustness to initialisation, ability to scale to large datasets, and alignment accuracy seem to hamper their wide appl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 44 publications
0
8
0
Order By: Relevance
“…Figure 2 illustrates the STN for classification task, where X o is the original input image, E 1 is the spatial transformer block, M is the estimated alignment matrix, Xb is the aligned image, and E 2 is the classification network. The loss function is L = min ||y − ŷ|| 2 2 , where y and ŷ are the ground truth and predicted image class label.…”
Section: Background: Spatial Transformer Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…Figure 2 illustrates the STN for classification task, where X o is the original input image, E 1 is the spatial transformer block, M is the estimated alignment matrix, Xb is the aligned image, and E 2 is the classification network. The loss function is L = min ||y − ŷ|| 2 2 , where y and ŷ are the ground truth and predicted image class label.…”
Section: Background: Spatial Transformer Networkmentioning
confidence: 99%
“…In adaptive Gabor convolutional networks [20], the convolutional kernels are adaptively multiplied by Gabor filters to achieve invariant information extracted from VOLUME 4, 2016 images. The spatial transformer network (STN) [4] is used in [2] to tackle the joint image alignment problem on larger datasets with higher variability. STN [4] can spatially transform the input images by embedding the spatial transformer block into a target network or algorithm.…”
mentioning
confidence: 99%
“…Several works [54,42,11,12] developed a similar idea for continuous parametric transformations in the simpler setting of image alignment, which was later applied again for clustering by [48,53,45,2]. Recently, Monnier et al [55] generalize these ideas to global alignments and large-scale datasets by leveraging neural networks to predict spatial alignments -implemented as spatial transformers [29] -, color transformations and morphological modifications.…”
Section: Related Workmentioning
confidence: 99%
“…Deep learning was recently used to scale the idea of congealing for global alignment of a single class of images [32] or time series [33]. Both works build on the idea of Spatial Transformer Networks [34] (STN) that spatial transformation are differentiable and can be learned by deep networks.…”
Section: Related Workmentioning
confidence: 99%