RPNet: An End-to-End Network for Relative Camera Pose Estimation

En, Sovann; Lechervy, Alexis; Jurie, Frédéric

doi:10.1007/978-3-030-11009-3_46

Cited by 34 publications

(20 citation statements)

References 25 publications

(35 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The result shows that the change interval of β in the outdoor scene is between 250 and 2000. Using cross-validation, RPNet [7] found the most suitable hyperparameter β value in different locations, and spends lots of time clustering the original dataset and testing the trained model for the evaluation. For RCPNet, we use automatic weights that scale on the loss function based on homoscedastic uncertainty (as in [55]) across all the locations, which is numerically more stable than β.…”

Section: Learning Relative Translation and Rotation Simultaneouslymentioning

confidence: 99%

“…Different from RPNet [7] and PoseNet [11] based on GoogLeNet, we use two branches of pre-trained ResNet34 networks [57] to construct a weight-sharing Siamese network [56]. The 6DoF relative camera pose is estimated end-to-end.…”

Section: Architecture Of Rcpnetmentioning

confidence: 99%

“…An effective method is needed to produce image pairs to achieve relative camera pose estimation. For the Cambridge Landmarks dataset, En et al [7] randomly paired every image with eight images in the same sequence. The training sequences and testing sequences are separated beforehand.…”

Section: Real Image Pairs Preparationmentioning

confidence: 99%

“…The images are rescaled to 256 × n or n × 256 pixels, n ≥ 256, and then are cropped into 224 × 224 patches as the input of CNN in the previous work [7,11]. The model is trained by random cropping and then tested by central cropping as data augmentation.…”

Section: Real Image Pairs Preparationmentioning

confidence: 99%

“…RCPNet is first compared with other learning-based pose estimation approaches PoseNet [11,55] and RPNet [7], using the real images from two datasets, namely, Tuebingen Buildings and Cambridge Landmarks. We then compare the accuracy between the real images and synthetic images trained RCPNet models on the two datasets.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Relative Camera Pose Estimation using Synthetic Data with Domain Adaptation via Cycle-Consistent Adversarial Networks

Yang

Liu

Zell

2021

J Intell Robot Syst

View full text Add to dashboard Cite

Learning-based visual localization has become prospective over the past decades. Since ground truth pose labels are difficult to obtain, recent methods try to learn pose estimation networks using pixel-perfect synthetic data. However, this also introduces the problem of domain bias. In this paper, we first build a Tuebingen Buildings dataset of RGB images collected by a drone in urban scenes and create a 3D model for each scene. A large number of synthetic images are generated based on these 3D models. We take advantage of image style transfer and cycle-consistent adversarial training to predict the relative camera poses of image pairs based on training over synthetic environment data. We propose a relative camera pose estimation approach to solve the continuous localization problem for autonomous navigation of unmanned systems. Unlike those existing learning-based camera pose estimation methods that train and test in a single scene, our approach successfully estimates the relative camera poses of multiple city locations with a single trained model. We use the Tuebingen Buildings and the Cambridge Landmarks datasets to evaluate the performance of our approach in a single scene and across-scenes. For each dataset, we compare the performance between real images and synthetic images trained models. We also test our model in the indoor dataset 7Scenes to demonstrate its generalization ability.

show abstract

Section: Learning Relative Translation and Rotation Simultaneouslymentioning

confidence: 99%

Section: Architecture Of Rcpnetmentioning

confidence: 99%

Section: Real Image Pairs Preparationmentioning

confidence: 99%

Section: Real Image Pairs Preparationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Relative Camera Pose Estimation using Synthetic Data with Domain Adaptation via Cycle-Consistent Adversarial Networks

Yang

Liu

Zell

2021

J Intell Robot Syst

View full text Add to dashboard Cite

show abstract

Associative3D: Volumetric Reconstruction from Sparse Views

Qian

Jin

Fouhey

2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

GeoGraph: Graph-Based Multi-view Object Detection with Geometric Cues End-to-End

Nassar

D’Aronco

Lefèvre

et al. 2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d'enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

show abstract

RPNet: An End-to-End Network for Relative Camera Pose Estimation

Cited by 34 publications

References 25 publications

Relative Camera Pose Estimation using Synthetic Data with Domain Adaptation via Cycle-Consistent Adversarial Networks

Relative Camera Pose Estimation using Synthetic Data with Domain Adaptation via Cycle-Consistent Adversarial Networks

Associative3D: Volumetric Reconstruction from Sparse Views

GeoGraph: Graph-Based Multi-view Object Detection with Geometric Cues End-to-End

Contact Info

Product

Resources

About