2020
DOI: 10.1109/access.2020.3010307
|View full text |Cite
|
Sign up to set email alerts
|

Simple Baseline for Vehicle Pose Estimation: Experimental Validation

Abstract: Significant progress on human and vehicle pose estimation has been achieved in recent years. The performance of these methods has evolved from poor to remarkable in just a couple of years. This improvement has been obtained from increasingly complex architectures. In this paper, we explore the applicability of simple baseline methods by adding a few deconvolutional layers on a backbone network to estimate heat maps that correspond to the vehicle keypoints. This approach has been proven to be very effective for… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 48 publications
(78 reference statements)
0
1
0
Order By: Relevance
“…However, Stacked Hourglass has some limitations, such as incomplete retention of high‐resolution information. To overcome this issue and improve accuracy, a study [34] used the SimpleBaseline network. VehiPose [35] employs the Waterfall Atrous Spatial Pooling architecture (WASP) [36] in semantic segmentation, enabling more spatially detailed features and efficient multi‐scale feature extraction.…”
Section: Related Workmentioning
confidence: 99%
“…However, Stacked Hourglass has some limitations, such as incomplete retention of high‐resolution information. To overcome this issue and improve accuracy, a study [34] used the SimpleBaseline network. VehiPose [35] employs the Waterfall Atrous Spatial Pooling architecture (WASP) [36] in semantic segmentation, enabling more spatially detailed features and efficient multi‐scale feature extraction.…”
Section: Related Workmentioning
confidence: 99%
“…Occlusion-net [34] uses a 3D graph network with self-supervision to predict 2D and 3D keypoints of vehicles using the CarFusion dataset [45], while GSNet [46] predicts 6DoF car pose and reconstructs dense 3D shape simultaneously. Without 3D information, the popular OpenPose [47] shows qualitative results for vehicles and Simple Baseline [48] extends a top-down pose estimator for cars on a custom dataset based on Pascal3D+ [49].…”
Section: Beyond Humansmentioning
confidence: 99%