2021
DOI: 10.48550/arxiv.2112.09205
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

AFDetV2: Rethinking the Necessity of the Second Stage for Object Detection from Point Clouds

Abstract: There have been two streams in the 3D detection from point clouds: single-stage methods and two-stage methods. While the former is more computationally efficient, the latter usually provides better detection accuracy. By carefully examining the two-stage approaches, we have found that if appropriately designed, the first stage can produce accurate box regression. In this scenario, the second stage mainly rescores the boxes such that the boxes with better localization get selected. From this observation, we hav… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 30 publications
0
4
0
Order By: Relevance
“…According to the test results in Table 2, DeepFusion achieves the best results on Waymo Challenge Leaderboard demonstrating the effectiveness of our approach. For example, DeepFusion-Ens achieves the best results on the Waymo Challenge Leaderboard; DeepFusion improves 2.42 APH/L2 compared with previous state-of-theart single-model method, AFDetV2 [12].…”
Section: State-of-the-art Performance On Waymo Datamentioning
confidence: 95%
See 1 more Smart Citation
“…According to the test results in Table 2, DeepFusion achieves the best results on Waymo Challenge Leaderboard demonstrating the effectiveness of our approach. For example, DeepFusion-Ens achieves the best results on the Waymo Challenge Leaderboard; DeepFusion improves 2.42 APH/L2 compared with previous state-of-theart single-model method, AFDetV2 [12].…”
Section: State-of-the-art Performance On Waymo Datamentioning
confidence: 95%
“…Our very best model concatenates 5 frames, and with dropframe probabilities 0.5 during training. Besides, we also apply model ensemble and Test-Time Augmentation (TTA) by weighted box fusion (WBF) [12]. For TTA, we use yaw rotation, and global scaling.…”
Section: A2 Implementation Details Of 3d Detectorsmentioning
confidence: 99%
“…Based on this paradigm, many studies [15], [16], [17] improve the performance from different perspectives. Another hot trend aims to enhance the sparse backbone by transformer architecture [18], such as [4], [19], [20], [21], [22], [23], [24].…”
Section: Semi-dense Detectorsmentioning
confidence: 99%
“…For both models, we use Stochastic Weights Averaging (SWA) [12,13] to further enhance the training. We train each model for one additional epoch using decreased learning rate and greedily average the weights.…”
Section: Model Variations and Ensemblingmentioning
confidence: 99%