2021
DOI: 10.1109/tnnls.2020.3044181
|View full text |Cite
|
Sign up to set email alerts
|

Masked GAN for Unsupervised Depth and Pose Prediction With Scale Consistency

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

2
7

Authors

Journals

citations
Cited by 43 publications
(17 citation statements)
references
References 35 publications
0
14
0
Order By: Relevance
“…The existing methods which are trained with monocular video sequences simultaneously predict the scene depths and estimate the camera poses [1, 3,5,13,17,18,21,24,31,34,[40][41][42]44]. Zhou et al [44] proposed an end-to-end approach comprised of two separate networks for predicting depths and camera poses.…”
Section: Self-supervised Monocular Trainingmentioning
confidence: 99%
See 1 more Smart Citation
“…The existing methods which are trained with monocular video sequences simultaneously predict the scene depths and estimate the camera poses [1, 3,5,13,17,18,21,24,31,34,[40][41][42]44]. Zhou et al [44] proposed an end-to-end approach comprised of two separate networks for predicting depths and camera poses.…”
Section: Self-supervised Monocular Trainingmentioning
confidence: 99%
“…We firstly evaluate the OCFD-Net with/without a postprocessing step (PP.) [12] on the raw KITTI Eigen test set [7] in comparison to 20 state-of-the-art methods, including 10 methods trained with monocular video sequences (M) [1, 17,18,21,24,31,34,[42][43][44] and 10 methods trained with stereo image pairs (S) [12,13,15,[26][27][28]32,36,37,45]. As done in [15,17], we also evaluate the OCFD-Net on the improved KITTI Eigen test set [33].…”
Section: Comparative Evaluationmentioning
confidence: 99%
“…Image and texture synthesis are challenging tasks [28], [29]. With the breakthrough of GANs [16], [30], [31], [32], [33], [34], [35], directly generating a handwritten text image has become an interesting topic. Non-recurrent generative methods [1], [9], [13], [14] can produce a handwritten text image according to a given text string.…”
Section: A Handwritten Text Image Synthesismentioning
confidence: 99%
“…Occlusions and moving objects affect the pixel correspondence between images, thus impacting the photometric loss during training and resulting in the limited performance of the depth network. A number of methods [12], [27]- [29] design a mask or mask network to estimate the regions that violate the projection, so as to reduce the effect of these regions on the training process. Since the mask network is jointly trained with pose and depth networks in an unsupervised manner, this method cannot completely address the influence of occlusions and moving objects.…”
Section: Related Workmentioning
confidence: 99%