2022
DOI: 10.3390/s22082936
|View full text |Cite
|
Sign up to set email alerts
|

Self-Supervised Object Distance Estimation Using a Monocular Camera

Abstract: Distance estimation using a monocular camera is one of the most classic tasks for computer vision. Current monocular distance estimating methods need a lot of data collection or they produce imprecise results. In this paper, we propose a network for both object detection and distance estimation. A network-based on ShuffleNet and YOLO is used to detect an object, and a self-supervised learning network is used to estimate distance. We calibrated the camera, and the calibrated parameters were integrated into the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 62 publications
0
3
0
Order By: Relevance
“…To alleviate inaccurate range estimations in different views, regression based on deep features can come in handy [62,[81][82][83][84][85]. Alike the FCN methods, the typical approach of deep regressors consists of two steps for object detection and distance regression.…”
Section: Range Estimation Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…To alleviate inaccurate range estimations in different views, regression based on deep features can come in handy [62,[81][82][83][84][85]. Alike the FCN methods, the typical approach of deep regressors consists of two steps for object detection and distance regression.…”
Section: Range Estimation Approachesmentioning
confidence: 99%
“…More recently, refs. [81,83] replaced R-CNN with YOLO and added a distance regression module exploiting extracted deep features. While the previous methods were limited to a specific set of object classes that are detectable by object detectors, ref.…”
Section: Range Estimation Approachesmentioning
confidence: 99%
“…For instance, [1] demonstrated a smartphone-based system for object detection, which, while effective, necessitates handheld operation and lacks distance estimation capabilities. Similarly, [2] explored the use of ultrasonic sensors attached to canes, offering some distance information but at the expense of user convenience and range of detection. Contrasting these approaches, our project innovates by integrating object detection and distance estimation into a single, wearable device, combining the convenience of a hands-free experience with the accuracy and immediacy of real-time feedback.…”
Section: B Project Statementmentioning
confidence: 99%
“…Compared to conventional stereo-based methods, self-supervised depth estimators provide widespread availability of training sequences of monocular video, but they need to simultaneously calculate depth and ego-motion by minimizing the photometric reprojection loss [51,56]. For the purposes of resolving the unknown scale factor through sequence input, a self-supervised learning network with a new reconstruction loss function was used with an average inference time of 12.5 ms per image [57]. As a faster state-of-the-art model, the Monodepth2 has acceptable results in the depth estimation used in an autonomous vehicle or smart navigation [58], which was enhanced from ResNet18.…”
Section: Introductionmentioning
confidence: 99%