2022
DOI: 10.48550/arxiv.2207.01610
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

PVO: Panoptic Visual Odometry

Abstract: We present a novel panoptic visual odometry framework, termed PVO, to achieve a more comprehensive modeling of the scene's motion, geometry, and panoptic segmentation information. PVO models visual odometry (VO) and video panoptic segmentation (VPS) in a unified view, enabling the two tasks to facilitate each other. Specifically, we introduce a panoptic update module into the VO module, which operates on the image panoptic segmentation. This Panoptic-Enhanced VO module can trim the interference of dynamic obje… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 40 publications
(67 reference statements)
0
1
0
Order By: Relevance
“…Depending on the constructed maps, conventional RGB-D SLAM systems are commonly categorised into sparse SLAM [1], [2], dense SLAM [3]- [5] and hybrid ones [6], [7]. Later, with advances in deep neural networks (DNNs), much attention has been paid to improve various aspects of SLAM with semantic information extracted with DNNs, such as meaningful mapping [8]- [10], dynamic tracking [11]- [13], relocalisation [14], [15], etc. Although these systems have demonstrated promising performance in terms of both tracking and reconstruction accuracy, they suffer from the huge video memory (VRAM) footprint in storing the reconstructed map and the computational consumption when modifying the map on-the-fly.…”
Section: Introductionmentioning
confidence: 99%
“…Depending on the constructed maps, conventional RGB-D SLAM systems are commonly categorised into sparse SLAM [1], [2], dense SLAM [3]- [5] and hybrid ones [6], [7]. Later, with advances in deep neural networks (DNNs), much attention has been paid to improve various aspects of SLAM with semantic information extracted with DNNs, such as meaningful mapping [8]- [10], dynamic tracking [11]- [13], relocalisation [14], [15], etc. Although these systems have demonstrated promising performance in terms of both tracking and reconstruction accuracy, they suffer from the huge video memory (VRAM) footprint in storing the reconstructed map and the computational consumption when modifying the map on-the-fly.…”
Section: Introductionmentioning
confidence: 99%