2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.01112
|View full text |Cite
|
Sign up to set email alerts
|

CrowdPose: Efficient Crowded Scenes Pose Estimation and a New Benchmark

Abstract: Multi-person pose estimation is fundamental to many computer vision tasks and has made significant progress in recent years. However, few previous methods explored the problem of pose estimation in crowded scenes while it remains challenging and inevitable in many scenarios. Moreover, current benchmarks cannot provide an appropriate evaluation for such cases. In this paper, we propose a novel and efficient method to tackle the problem of pose estimation in the crowd and a new dataset to better evaluate algorit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
236
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 431 publications
(259 citation statements)
references
References 43 publications
3
236
0
Order By: Relevance
“…Each of them comes with different advantages and disadvantages. Although bottom-up methods seem to be better suited for crowded scenes since they process whole images and their runtime is thus less dependent of the actual number of persons, literature shows that top-down methods like [6] and [11] perform comparably well on publicly available datasets.…”
Section: Top-down and Bottom-up Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Each of them comes with different advantages and disadvantages. Although bottom-up methods seem to be better suited for crowded scenes since they process whole images and their runtime is thus less dependent of the actual number of persons, literature shows that top-down methods like [6] and [11] perform comparably well on publicly available datasets.…”
Section: Top-down and Bottom-up Methodsmentioning
confidence: 99%
“…However, some recent works focus on this subject. Li et al [6] introduce a new benchmark for evaluating pose estimation methods for this problem and proposed a method where human bounding box proposals obtained by human detector are fed into joint-candidate single person pose estimator (JCSPPE). JC-SPPE locates the joint candidates with different response scores on the heatmap.…”
Section: Further Work On Crowd Pose Estimationmentioning
confidence: 99%
See 2 more Smart Citations
“…Hierarchical/Graphical Models in Computer Vision: Hierarchical/graphical models are powerful for building structured representations, which can reflect task-specific relations and constraints. From early distributional semantic models, part-based models [16,17], MRF/CRF [31], And-Or grammar model [59], to deep structural networks [30,15], graph neural networks [20], trainable CRF [79], etc., hierarchical/graphical models have found applications in a wide variety of core computer vision tasks, such as object recognition [55], human parsing [40,41,81], pose estimation [34,66,61,68,35], visual dialog etc., to the extent that they are now ubiquitous in the field. Inspired by their general success, we leverage structural information to design our approach.…”
Section: Related Workmentioning
confidence: 99%