2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2021
DOI: 10.1109/cvprw53098.2021.00275
|View full text |Cite
|
Sign up to set email alerts
|

Real-time Monocular Depth Estimation with Sparse Supervision on Mobile

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(6 citation statements)
references
References 39 publications
0
4
0
Order By: Relevance
“…FastDepth achieves 178 fps on the 8GB NVIDIA Jetson TX2 device. Recently, Yucel et al [34] propose a small network composed by the MobileNet v2 [33] as encoder and FBNet x112 [35] as decoder, trained on an altered knowledge distillation process; the model achieves 37 fps on smartphone GPU. Papa et al [7] design SPEED, a separable pyramidal pooling architecture characterized by an improved version of the MobileNet v1 [36] as an encoder and a dedicated decoder.…”
Section: Lightweight Mde Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…FastDepth achieves 178 fps on the 8GB NVIDIA Jetson TX2 device. Recently, Yucel et al [34] propose a small network composed by the MobileNet v2 [33] as encoder and FBNet x112 [35] as decoder, trained on an altered knowledge distillation process; the model achieves 37 fps on smartphone GPU. Papa et al [7] design SPEED, a separable pyramidal pooling architecture characterized by an improved version of the MobileNet v1 [36] as an encoder and a dedicated decoder.…”
Section: Lightweight Mde Methodsmentioning
confidence: 99%
“…In this section, METER is compared with state of the art lightweight models as [7], [8], [31], [32], [34], which are designed to infer at high speed on embedded devices while keeping a small memory footprint (lower than 3GB). This choice is due to the limited amount of available memory in the chosen platforms.…”
Section: A Comparison With State Of the Art Methodsmentioning
confidence: 99%
“…Real-time depth estimators are often motivated by other applications like robotics [25], automated driving [64], or low-resolution estimation for augmented reality in mobile devices [63], therefore, do not yet provide sufficient performance for immersive video. Yet, deep-learning based monocular depth estimation is also progressing quickly [62], [65], indicating the possibility of using them in near future also for immersive video applications.…”
Section: B Depth Estimation For Immersive Videomentioning
confidence: 99%
“…One simple alternative is employing lightweight architectures such as MobileNet [24,25,49,55], GhostNet [21], and FBNet [54]. One popular approach is utilizing network compression techniques, including quantization [22], network pruning [58], and knowledge distillation [60]. Other methods employ well-known pyramid networks or dynamic optimization schemes [1].…”
Section: Related Workmentioning
confidence: 99%