2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018
DOI: 10.1109/cvprw.2018.00056
|View full text |Cite
|
Sign up to set email alerts
|

Pose Encoding for Robust Skeleton-Based Action Recognition

Abstract: Some of the main challenges in skeleton-based action recognition systems are redundant and noisy pose transformations. Earlier works in skeleton-based action recognition explored different approaches for filtering linear noise transformations, but neglect to address potential nonlinear transformations. In this paper, we present an unsupervised learning approach for estimating nonlinear noise transformations in pose estimates. Our approach starts by decoupling linear and nonlinear noise transformations. While t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
28
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2
1

Relationship

3
5

Authors

Journals

citations
Cited by 35 publications
(29 citation statements)
references
References 19 publications
1
28
0
Order By: Relevance
“…As future work, we intend to explore in more detail the properties of the deformation-based alignment in motion analysis. For that purpose, a penalty term would be added in the minimization problem, such that it would incorporate problemrelated constraints, e.g., 3D skeleton-specific geometry or noisefree skeleton dynamics [30]. We also intend to expand the proposed approach to represent a skeleton sequence as a point in the deformation space, without any prior knowledge.…”
Section: Discussionmentioning
confidence: 99%
“…As future work, we intend to explore in more detail the properties of the deformation-based alignment in motion analysis. For that purpose, a penalty term would be added in the minimization problem, such that it would incorporate problemrelated constraints, e.g., 3D skeleton-specific geometry or noisefree skeleton dynamics [30]. We also intend to expand the proposed approach to represent a skeleton sequence as a point in the deformation space, without any prior knowledge.…”
Section: Discussionmentioning
confidence: 99%
“…ii The biases of the gating units, g l , in G l , at layer l are initialized to negative values such as -1 or -3 [10] to ensure that most of the features learned at layer l, H(x) l , are untransformed features, H(x) l−1 . This is due to the gating units' activations, G l (H(x) l−1 ), being close to 0; see (1).…”
Section: A Background: Highway Networkmentioning
confidence: 95%
“…D eep neural networks have found applications in many real-life tasks; their successes for learning different difficult problems are well documented. In particular, the field of computer vision has hugely benefited from deep neural networks for various applications ranging from pose estimation [1], segmentation [2], action recognition [3], face recognition [4], etc. In recent times, there is a growing trend of using deep networks for learning directly from raw data (i.e.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…TABLE IV A COMPARISON BETWEEN THE PROPOSED METHOD AND STATE-OF-THE-ART APPROACHES IN TERMS OF NORTH WESTERN UCLA DATASET. Paper Cross-subject Cross-view Virtual view [34] 50.70 47.80 Hankelet [35] 54.20 45.20 MST-AOG [26] 81.60 73.30 Action Bank [36] 24.60 17.60 Poselet [37] 54.90 24.50 Denoised-LSTM [38] -79.57 tLDS [39] 92 It can be seen in Table IV, Virtual view [34] and Hanklet [35] methods are limited in their performance which reflects the challenges of the North Western UCLA dataset (e.g. noise, cluttered backgrounds and various view points).…”
Section: A North Western Ucla Datasetmentioning
confidence: 99%