2022
DOI: 10.3390/s22072576
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning-Based Monocular 3D Object Detection with Refinement of Depth Information

Abstract: Recently, the research on monocular 3D target detection based on pseudo-LiDAR data has made some progress. In contrast to LiDAR-based algorithms, the robustness of pseudo-LiDAR methods is still inferior. After conducting in-depth experiments, we realized that the main limitations are due to the inaccuracy of the target position and the uncertainty in the depth distribution of the foreground target. These two problems arise from the inaccurate depth estimation. To deal with the aforementioned problems, we propo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 50 publications
0
11
0
Order By: Relevance
“…In order to prove the performance of the proposed method, it is compared with the methods in reference [26] and reference [27] under the same experimental conditions. Te comparison results are shown in Table 2.…”
Section: Performance Comparison With Other Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…In order to prove the performance of the proposed method, it is compared with the methods in reference [26] and reference [27] under the same experimental conditions. Te comparison results are shown in Table 2.…”
Section: Performance Comparison With Other Methodsmentioning
confidence: 99%
“…Reference [26] introduces deformable convolution into the point cloud target detection network to enhance the adaptability of the network to detection targets in diferent directions and shapes. Reference [27] proposed a target detection method based on improved GUPNET. Te experimental results show that the average accuracy of the proposed method is the highest, with an average accuracy of 0.919 in simple difcult scenes, 0.897 in medium difcult scenes, and 0.839 in difcult scenes; reference [27] has an average accuracy of 0.900 in simple difcult scenes, 0.856 in medium difcult scenes, and 0.802 in difcult scenes; the average accuracy of reference [26] is the lowest, with the average accuracy of only 0.896 in simple difcult scenes, 0.879 in medium difcult scenes, and 0.783 in difcult scenes.…”
Section: Performance Comparison With Other Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…On the KITTI 3D object detection benchmark, our model ranks first in all difficulty levels among all of the previously monocular models by achieving an AP moderate score of 20.95%. Our model outperforms the previously best 3D intermidiate representation method [7] by a large margin of 3.15%. On the KITTI bird's eye view benchmark, our model achieves an AP moderate score of 28.50%, outperforming all of the previously published models.…”
Section: Benchmark Evaluationmentioning
confidence: 73%
“…However, the point clouds generated from photogrammetry are limited with poor precision which affects the object detection accuracy [2]. To overcome the issue of point clouds, LiDAR (Light Detection and Ranging) sensor is introduced which directly acquires the point clouds from the object and shows promising results in terms of accuracy, and provides faster results [3]. However, LiDARis susceptible to environmental noise, sparsity, and lack of depth information.…”
Section: Index Terms -3d Object Detectionnoise Removalsemantic Segmen...mentioning
confidence: 99%