2021
DOI: 10.48550/arxiv.2103.01100
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Categorical Depth Distribution Network for Monocular 3D Object Detection

Abstract: Monocular 3D object detection is a key problem for autonomous vehicles, as it provides a solution with simple configuration compared to typical multi-sensor systems. The main challenge in monocular 3D detection lies in accurately predicting object depth, which must be inferred from object and scene cues due to the lack of direct range measurement. Many methods attempt to directly estimate depth to assist in 3D detection, but show limited performance as a result of depth inaccuracy. Our proposed solution, Categ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 43 publications
(6 citation statements)
references
References 54 publications
1
5
0
Order By: Relevance
“…Furthermore, our method also outperforms the methods with extra data. For instance, compared with the recently proposed CaDNN [38] utilizing LiDAR signals as supervision of depth estimation sub-task, our method still obtains 0.94%, 0.79%, and 0.31% gains on the three difficulty settings, which confirms the effectiveness of the proposed method.…”
Section: Resultssupporting
confidence: 66%
“…Furthermore, our method also outperforms the methods with extra data. For instance, compared with the recently proposed CaDNN [38] utilizing LiDAR signals as supervision of depth estimation sub-task, our method still obtains 0.94%, 0.79%, and 0.31% gains on the three difficulty settings, which confirms the effectiveness of the proposed method.…”
Section: Resultssupporting
confidence: 66%
“…A quantitative comparison between 2D and 3D detectors is included in Section 5.2. methods. On state-of-the-art 3D detection benchmarks [2,12], state-of-the-art monocular methods [26,41] achieve about half the mAP detection accuracy, of standard Lidar based baselines [60]. Pseudo-Lidar [56] based methods produce a virtual point cloud from RGB images, similar to our approach.…”
Section: Related Workmentioning
confidence: 73%
“…We also achieve state-of-the-art performance on WOD [38] as Table 3 shows. We surpass the previous state-of-the-art methods such as CaDDN [31] and PCT [41]. We also re-implement MonoFlex [51] on WOD [38] for a fair comparison.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 99%
“…Italics: These methods utilize the whole train set, while the others use 1/3 amount of images in train set. ‡: M3D-RPN is reimplemented by [31]. †: PatchNet is re-implemented by [41].…”
Section: Ablation Studiesmentioning
confidence: 99%