2019
DOI: 10.1109/tpami.2017.2782743
|View full text |Cite
|
Sign up to set email alerts
|

Panoptic Studio: A Massively Multiview System for Social Interaction Capture

Abstract: We present an approach to capture the 3D motion of a group of people engaged in a social interaction, where inter-occlusions are frequent and functional. The Panoptic Studio is a system organized around the thesis that social interactions should be measured through the integration of perceptual analyses over a large variety of viewpoints. We present a modularized system designed around this principle, consisting of integrated structural, hardware, and software innovations. The system takes, as input, 480 synch… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
246
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 259 publications
(252 citation statements)
references
References 50 publications
1
246
0
1
Order By: Relevance
“…We generate the ground truth depth maps from the point cloud with the screened Poisson surface reconstruction method [15]. We choose scenes: 1,4,9,10,11,12,13,15,23,24,29,32,33,34,48,49,62,75,77,110,114,118 as the testing set and the other scenes as training set. The RGBD, SUN3D, MVS and Scenes11 datasets contain more than 30000 different scenes in total, which are very different from the DTU dataset.…”
Section: Implementation Detailsmentioning
confidence: 99%
“…We generate the ground truth depth maps from the point cloud with the screened Poisson surface reconstruction method [15]. We choose scenes: 1,4,9,10,11,12,13,15,23,24,29,32,33,34,48,49,62,75,77,110,114,118 as the testing set and the other scenes as training set. The RGBD, SUN3D, MVS and Scenes11 datasets contain more than 30000 different scenes in total, which are very different from the DTU dataset.…”
Section: Implementation Detailsmentioning
confidence: 99%
“…We project the 3D human poses of different HOIs into 2D poses with random camera poses. (ii) The dataset proposed and collected by [19], which also contains 3D poses of multiple persons in social interactions. We project 3D poses into 2D following the same method as in (i).…”
Section: Concurrent Action Detectionmentioning
confidence: 99%
“…The vector ↔ L n,: X * in Eq. (20), lies on a local motion plane formed by X * n,: and it's two neighboring points. Similarly,each row in LX * will also be a vector on a local motion plane.…”
Section: Structure Reconstruction Accuracymentioning
confidence: 99%
“…Average running time for minimizing either X or W are smaller due to the sparsity of W. Total number of iterations depends on initialization quality, reported experiments ran an average of 62.26 iterations. Results on Dancing and Toddler[20]. Disjoint Dancing segments form an input datum.…”
mentioning
confidence: 99%