2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.01906
|View full text |Cite
|
Sign up to set email alerts
|

RFNet: Unsupervised Network for Mutually Reinforcing Multi-modal Image Registration and Fusion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
21
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 55 publications
(21 citation statements)
references
References 30 publications
0
21
0
Order By: Relevance
“…Two-view corresponding is a fundamental problem in computer vision. It aims to establish sparse feature correspondences/matches between two-view images and estimate geometry relationship, serving as a premise for many complex vision problems such as structure from motion (Snavely, Seitz, and Szeliski 2008), simultaneous location and mapping (Mur-Artal, Montiel, and Tardos 2015), visual localization (Philbin et al 2010), and image fusion (Xu et al 2022). The most classical geometry matching pipeline starts from feature extraction and matching, and great efforts have been spent on handcrafted or learning-based detectors and descriptors, e.g., SIFT (Lowe 2004) and SuperPoint (DeTone, Malisiewicz, and Rabinovich 2018).…”
Section: Introductionmentioning
confidence: 99%
“…Two-view corresponding is a fundamental problem in computer vision. It aims to establish sparse feature correspondences/matches between two-view images and estimate geometry relationship, serving as a premise for many complex vision problems such as structure from motion (Snavely, Seitz, and Szeliski 2008), simultaneous location and mapping (Mur-Artal, Montiel, and Tardos 2015), visual localization (Philbin et al 2010), and image fusion (Xu et al 2022). The most classical geometry matching pipeline starts from feature extraction and matching, and great efforts have been spent on handcrafted or learning-based detectors and descriptors, e.g., SIFT (Lowe 2004) and SuperPoint (DeTone, Malisiewicz, and Rabinovich 2018).…”
Section: Introductionmentioning
confidence: 99%
“…10,29 Deep learning-based methods, such as RFNet, AT-GAN, and SemLA, utilize deep neural networks to promote multimodal image registration accuracy. 15,36,37 However, deep learning-based methods depend greatly on the quality of training data, which limits their performance in the registration of coral reefs with less texture.…”
Section: Introductionmentioning
confidence: 99%
“… 12 Generally, there are four types of automatic registration methods: intensity-based methods, feature-based methods, coarse-to-fine methods, and deep learning-based methods 13 15 Intensity-based methods, such as grayscale matching, match images by comparing reflection differences between pixels within correlation windows in images to be matched. These methods require remote sensing images with high consistency in terms of scale and orientation 9 .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Moreover, there are many methods based on deep learning, such as Mu-net [18], PCNet [19], RFNet [20] and Fourier-Net [21]. In order to adapt various types of multimodal images, Mu-Net uses the structural similarity to design a loss function that allows Mu-net to achieve comprehensive and accurate registration [18].…”
Section: Introductionmentioning
confidence: 99%