Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018
DOI: 10.1109/cvprw.2018.00061
|View full text |Cite
|
Sign up to set email alerts
|

Global Pose Estimation with an Attention-Based Recurrent Network

Abstract: The ability for an agent to localize itself within an environment is crucial for many real-world applications. For unknown environments, Simultaneous Localization and Mapping (SLAM) enables incremental and concurrent building of and localizing within a map. We present a new, differentiable architecture, Neural Graph Optimizer, progressing towards a complete neural network solution for SLAM by designing a system composed of a local pose estimation model, a novel pose selection module, and a novel graph optimiza… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
43
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 79 publications
(48 citation statements)
references
References 44 publications
0
43
0
1
Order By: Relevance
“…We harness CNNs to encode images into high-level features. Optical flow has been proved useful for estimating frame-to-frame ego-motion by lots of current works [22,[31][32][33]38]. We design the encoder based on the Flownet [6] which predicts optical flow between two images.…”
Section: Encodermentioning
confidence: 99%
See 1 more Smart Citation
“…We harness CNNs to encode images into high-level features. Optical flow has been proved useful for estimating frame-to-frame ego-motion by lots of current works [22,[31][32][33]38]. We design the encoder based on the Flownet [6] which predicts optical flow between two images.…”
Section: Encodermentioning
confidence: 99%
“…The Refining module ameliorates previous outputs by employing a spatialtemporal feature reorganization mechanism. 22,[31][32][33]. Due to the high dimensionality of depth maps, the number of frames is commonly limited to no more than 5.…”
Section: Introductionmentioning
confidence: 99%
“…Zhu et al (2017) also peformed visual navigation via deep reinforcement learning but explicitly left out mapping the environment. Parisotto et al (2018) used a neural architecture for localisation; a graph of observations can be seen as a map, which is then used to iteratively refine a pose trajectory.…”
Section: Related Workmentioning
confidence: 99%
“…Neural solutions for the SLAM problem are also being developed (e.g. [7]), but their architectures are far from purely discriminative DCNNs.…”
Section: Frameworkmentioning
confidence: 99%