2019
DOI: 10.1007/s11042-019-7433-7
|View full text |Cite
|
Sign up to set email alerts
|

Deep neural networks for human pose estimation from a very low resolution depth image

Abstract: The work presented in the paper is dedicated to determining and evaluating the most efficient neural network architecture applied as a multiple regression network localizing human body joints in 3D space based on a single low resolution depth image. The main challenge was to deal with a noisy and coarse representation of the human body, as observed by a depth sensor from a large distance, and to achieve high localization precision. The regression network was expected to reason about relations of body parts bas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
10
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(11 citation statements)
references
References 25 publications
0
10
0
Order By: Relevance
“…By converting it to the frequency domain and using the Fast Fourier Transform (FFT), we sample all 512 points. Since the number of human fingers per palm is always less than six, only the first six frequencies (F 1 -F 6 ) need to be considered and described as in Equations (11)(12)(13)(14):…”
Section: Methodsmentioning
confidence: 99%
“…By converting it to the frequency domain and using the Fast Fourier Transform (FFT), we sample all 512 points. Since the number of human fingers per palm is always less than six, only the first six frequencies (F 1 -F 6 ) need to be considered and described as in Equations (11)(12)(13)(14):…”
Section: Methodsmentioning
confidence: 99%
“…HUMAN4D enables research to human pose-related computer vision tasks by providing spatio-temporally aligned RGBD data from multiple views under a HW-SYNC setting, along with accurate 3D and 2D poses. Recent research efforts are devoted on various single-and multi-person pose estimation approaches, from single RGB in the wild [18], [57]- [59], depth [60], [61], multi-view RGB [23], [62] and multiview RGBD [22], [63], among others. However, the selection criteria of the methods we benchmark are to be open-source and applicable to HUMAN4D, producing baseline results for our dataset.…”
Section: Pose Estimationmentioning
confidence: 99%
“…Chen et al [24] propose to use generative adversarial networks to exploit the constrained human-pose distribution for improving single-person pose estimation. Szczuko proposes to localize single-person body joints in 3D space based on a single low resolution depth image [25].…”
Section: Deep Neural Network For Single Person Pose Estimationmentioning
confidence: 99%
“…In addition, the accuracy of non-deep neural network approaches is lower than the following deep neural network approaches [16,18]. The single person pose estimation works with the guarantee of only one person present in the image [16][17][18][19][20][21][22][23][24][25]35,36]. However, the scenes with multi-person are exceedingly common in our daily life.…”
Section: Information Used For Human Pose Estimationmentioning
confidence: 99%