2021
DOI: 10.48550/arxiv.2112.08635
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Road-aware Monocular Structure from Motion and Homography Estimation

Abstract: Structure from motion (SFM) and ground plane homography estimation are critical to autonomous driving and other robotics applications. Recently, much progress has been made in using deep neural networks for SFM and homography estimation respectively. However, directly applying existing methods for ground plane homography estimation may fail because the road is often a small part of the scene. Besides, the performances of deep SFM approaches are still inferior to traditional methods. In this paper, we propose a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…Recently, some Convolutional Neural Networks (CNN) have also been proposed to estimate ground planes. Particularly, given a monocular image sequence, photometric consistency can be used with homography warping to recover the normal vector in a self-supervised manner [ 25 , 26 ]. To further improve the accuracy, GroundNet [ 27 ] jointly learns pixel-level normals, ground segmentation, and depth maps using multiple networks.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Recently, some Convolutional Neural Networks (CNN) have also been proposed to estimate ground planes. Particularly, given a monocular image sequence, photometric consistency can be used with homography warping to recover the normal vector in a self-supervised manner [ 25 , 26 ]. To further improve the accuracy, GroundNet [ 27 ] jointly learns pixel-level normals, ground segmentation, and depth maps using multiple networks.…”
Section: Related Workmentioning
confidence: 99%
“…Traditional methods usually first estimate homography transform, then decompose it into ground plane normal and ego-motion [ 23 , 24 ]. Recently, some neural networks were proposed to estimate the depth and normal simultaneously at the pixel level, with photometric and geometric consistency [ 25 , 26 , 27 ]. However, these image-based methods suffer from inadequate accuracy due to a loose connection between the ground plane normal dynamics and image clues.…”
Section: Introductionmentioning
confidence: 99%