2021
DOI: 10.3390/s21123945
|View full text |Cite
|
Sign up to set email alerts
|

HUMANNET—A Two-Tiered Deep Neural Network Architecture for Self-Occluding Humanoid Pose Reconstruction

Abstract: Majority of current research focuses on a single static object reconstruction from a given pointcloud. However, the existing approaches are not applicable to real world applications such as dynamic and morphing scene reconstruction. To solve this, we propose a novel two-tiered deep neural network architecture, which is capable of reconstructing self-obstructed human-like morphing shapes from a depth frame in conjunction with cameras intrinsic parameters. The tests were performed using on custom dataset generat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7

Relationship

2
5

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 56 publications
(53 reference statements)
0
6
0
Order By: Relevance
“…For each region of interest (ROI), a multi-objective loss function consisting of classification losses (1,2), localization losses (3,4), and mask segmentation ( 5) is used.…”
Section: Loss Functionmentioning
confidence: 99%
See 1 more Smart Citation
“…For each region of interest (ROI), a multi-objective loss function consisting of classification losses (1,2), localization losses (3,4), and mask segmentation ( 5) is used.…”
Section: Loss Functionmentioning
confidence: 99%
“…Kulikajev et al carried out a series of studies [2][3][4] involving 3D reconstruction of the entire human body. These studies were unique due to their consideration of data as objects from an imperfect real-world frame; that is, the original data for reconstruction may include noise, glare, highlights, and low photo quality.…”
Section: Introductionmentioning
confidence: 99%
“…Recognition and detection of human poses are very widely used in neural networks, as they have excellent accuracy and effectiveness on larger datasets [34,35]. However, there is a limitation in the DNN model since the minute intersections or joints of a canine feature detection are very confused to detect the pose.…”
Section: Feature Extractionmentioning
confidence: 99%
“…FPS uses Euclidean distance metric to iteratively search for the sampling points, and the selected point is that farthest from other unselected members in each iteration [16]. Kulikajevas et al [8] proposed a two-tiered deep neural network for self-occluding humanoid pose reconstruction, in which the clipping network is designed to clip the region of interest and down sampling it with FPS for the subsequent reconstruction network. Since FPS can well cover the whole set of points, several methods use it to extract the feature point [35,38].…”
Section: Introductionmentioning
confidence: 99%