2021
DOI: 10.1109/tpami.2019.2929257
|View full text |Cite
|
Sign up to set email alerts
|

OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields

Abstract: Realtime multi-person 2D pose estimation is a key component in enabling machines to have an understanding of people in images and videos. In this work, we present a realtime approach to detect the 2D pose of multiple people in an image. The proposed method uses a nonparametric representation, which we refer to as Part Affinity Fields (PAFs), to learn to associate body parts with individuals in the image. This bottom-up system achieves high accuracy and realtime performance, regardless of the number of people i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
2,155
0
29

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 3,268 publications
(2,449 citation statements)
references
References 69 publications
2
2,155
0
29
Order By: Relevance
“…The 2D projections are thereafter grouped into mutually exclusive clusters, and for each cluster, we allocate a representative pose. Then, in the second stage, for an input frame taken from a video stream, we extract in real time its 2D pose using the OpenPose network . The 2D pose is then rescaled so as to be consistent with the 2D projections stored in the database.…”
Section: Methods Overviewmentioning
confidence: 99%
See 4 more Smart Citations
“…The 2D projections are thereafter grouped into mutually exclusive clusters, and for each cluster, we allocate a representative pose. Then, in the second stage, for an input frame taken from a video stream, we extract in real time its 2D pose using the OpenPose network . The 2D pose is then rescaled so as to be consistent with the 2D projections stored in the database.…”
Section: Methods Overviewmentioning
confidence: 99%
“…The human poses in these CMU 3D data are represented by n =30 joint positions and rotations. In our work, we use 2D poses that are estimated from an input monocular video using OpenPose (see Section 5 for more details) that are represented by m =14 joint locations (see Figure ). Thus, in order to have a uniform and comparable skeleton, we retarget the CMU .bvh data to a 3D skeleton that its projection in T‐pose matches the 2D skeleton (in T‐pose) returned by OpenPose (see Figure for the new skeleton); the 2D pose projections are then scaled so as their bounding box remains constant over time.…”
Section: Motion Databasementioning
confidence: 99%
See 3 more Smart Citations