Loc-NeRF: Monte Carlo Localization using Neural Radiance Fields

Maggio, Dominic; Abate, Marcus; Shi, Jingnan; Mario, Courtney; Carlone, Luca

doi:10.48550/arxiv.2209.09050

Cited by 3 publications

(5 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, the training process remains lengthy, as DFNet still needs to train NeRF, pose regression, and feature extraction networks separately. Maggio et al proposed a Monte Carlo localization method called Loc-NeRF (Maggio et al 2022), where Loc-NeRF continuously samples candidate poses under the initial pose and uses NeRF to render novel views to find the correct pose direction. However, Loc-NeRF is unstable and still requires an initial camera pose.…”

Section: Related Workmentioning

confidence: 99%

“…Moreover, from the adaptation module, we can also learn a score for each dense feature for better feature matching and initial localization. Second, the renderingbased optimization may be easily stuck in the local minimum (Maggio et al 2022) due to the backpropagation through the networks and also time-consuming. To improve the neural rendering-based optimization with point-based representation, we further propose a novel efficient rendering-based optimization framework by aligning the rendered image with the query image and minimizing the warping loss function.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

PNeRFLoc: Visual Localization with Point-Based Neural Radiance Fields

Zhao,

Yang,

Mao

et al. 2024

AAAI

View full text Add to dashboard Cite

Due to the ability to synthesize high-quality novel views, Neural Radiance Fields (NeRF) has been recently exploited to improve visual localization in a known environment. However, the existing methods mostly utilize NeRF for data augmentation to improve the regression model training, and their performances on novel viewpoints and appearances are still limited due to the lack of geometric constraints. In this paper, we propose a novel visual localization framework, i.e., PNeRFLoc, based on a unified point-based representation. On one hand, PNeRFLoc supports the initial pose estimation by matching 2D and 3D feature points as traditional structure-based methods; on the other hand, it also enables pose refinement with novel view synthesis using rendering-based optimization. Specifically, we propose a novel feature adaption module to close the gaps between the features for visual localization and neural rendering. To improve the efficacy and efficiency of neural rendering-based optimization, we also developed an efficient rendering-based framework with a warping loss function. Extensive experiments demonstrate that PNeRFLoc performs the best on the synthetic dataset when the 3D NeRF model can be well learned, and significantly outperforms all the NeRF-boosted localization methods with on-par SOTA performance on the real-world benchmark localization datasets. Project webpage: https://zju3dv.github.io/PNeRFLoc/.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

PNeRFLoc: Visual Localization with Point-Based Neural Radiance Fields

Zhao,

Yang,

Mao

et al. 2024

AAAI

View full text Add to dashboard Cite

show abstract

“…NeRF predicts the RGB color and density of a point in a scene so that an image from an arbitrary viewpoint can be rendered. This property enables pose estimation [1,30,31,44] based on the photometric loss between the observed image and the rendered image, or manipulation of tricky objects [5,12,14,25,29]. A pretrained NeRF can also work as a virtual simulator, in which a robot can plan its trajectory [1] or can be used to train an action policy for the real-world [6].…”

Section: Related Workmentioning

confidence: 99%

“…There have been a number of studies that leverage neural radiance fields (NeRF) for various applications other than novel view synthesis. The robotic applications of NeRF are also now beginning to emerge [1,15,31,44,55]. However, many of the approaches require pretrained neural radiance fields about a specific scene and are not generalizable to various scenes.…”

Section: Introductionmentioning

confidence: 99%

Renderable Neural Radiance Map for Visual Navigation

Kwon¹,

Park²,

Oh³

2023

Preprint

View full text Add to dashboard Cite

We propose a novel type of map for visual navigation, a renderable neural radiance map (RNR-Map), which is designed to contain the overall visual information of a 3D environment. The RNR-Map has a grid form and consists of latent codes at each pixel. These latent codes are embedded from image observations, and can be converted to the neural radiance field which enables image rendering given a camera pose. The recorded latent codes implicitly contain visual information about the environment, which makes the RNR-Map visually descriptive. This visual information in RNR-Map can be a useful guideline for visual localization and navigation. We develop localization and navigation frameworks that can effectively utilize the RNR-Map. We evaluate the proposed frameworks on camera tracking, visual localization, and image-goal navigation. Experimental results show that the RNR-Map-based localization framework can find the target location based on a single query image with fast speed and competitive accuracy compared to other baselines. Also, this localization framework is robust to environmental changes, and even finds the most visually similar places when a query image from a different environment is given. The proposed navigation framework outperforms the existing image-goal navigation methods in difficult scenarios, under odometry and actuation noises. The navigation framework shows 65.7% success rate in curved scenarios of the NRNS [23] dataset, which is an improvement of 18.6% over the current state-of-the-art. Project page: https: //rllab-snu.github.io/projects/RNR-Map/

show abstract

“…Scene coordinate regression directly predicts absolute 3D coordinates of image pixels and uses scene structure explicitly to improve accuracy. More recently, many efforts [28] [38][22] [17] have been made to use implicit neural representation to replace explicit 3D models in localization pipeline. Different from the commonly used discrete 3D models like point cloud and voxel grid, NeRF is one of the implicit 3D representations inferred from a sparse set of posed images, which models geometry and visual information in continuous 3D space.…”

Section: Introductionmentioning

confidence: 99%

A Study of Latent Tuberculosis Infection And its Influencing Factors in Diabetic Patients in Nanshan District, Shenzhen, China

Liu

et al. 2022

SSRN Journal

View full text Add to dashboard Cite

Loc-NeRF: Monte Carlo Localization using Neural Radiance Fields

Cited by 3 publications

References 39 publications

PNeRFLoc: Visual Localization with Point-Based Neural Radiance Fields

PNeRFLoc: Visual Localization with Point-Based Neural Radiance Fields

Renderable Neural Radiance Map for Visual Navigation

A Study of Latent Tuberculosis Infection And its Influencing Factors in Diabetic Patients in Nanshan District, Shenzhen, China

Contact Info

Product

Resources

About