2020
DOI: 10.1007/s11390-020-0405-6
|View full text |Cite
|
Sign up to set email alerts
|

Two-Stream Temporal Convolutional Networks for Skeleton-Based Human Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
22
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(23 citation statements)
references
References 38 publications
0
22
0
Order By: Relevance
“…To collect 3D skeleton data, RGB deep images are captured by the Microsoft Kinect sensor. This method is one of the most popular to estimate 3D human pose [ 5 , 16 , 18 ]. The method converts 2D image detections from multiple camera views into 3D images [ 28 , 29 , 30 ].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…To collect 3D skeleton data, RGB deep images are captured by the Microsoft Kinect sensor. This method is one of the most popular to estimate 3D human pose [ 5 , 16 , 18 ]. The method converts 2D image detections from multiple camera views into 3D images [ 28 , 29 , 30 ].…”
Section: Related Workmentioning
confidence: 99%
“…The skeleton data representation of human poses in videos is a popular technique for action recognition [ 5 , 15 , 16 , 17 , 18 , 19 ]. In this technique, the main task is to identify the skeleton data, including the detailed location of joints.…”
Section: Introductionmentioning
confidence: 99%
“…Jia et al [ 9 ] introduced a new variant of feature representation for the skeleton-based human action recognition problem. They divided the usual vector representation for a human skeleton into five relevant joint subgroups, namely the left arm, right arm, left leg, right leg and trunk and then those parts were linked together into a whole body with the head.…”
Section: Related Workmentioning
confidence: 99%
“…Action—actions are single-person activities that may be composed of multiple gestures organized temporally. Most datasets [ 2 , 3 , 4 , 5 , 6 ] and most proposed solutions [ 7 , 8 , 9 , 10 , 11 ] are focused on this category.…”
Section: Introductionmentioning
confidence: 99%
“…Even though skeleton pose estimation is a structured data type, several methods approached the problem with 2D ConvNets [74][75][76][77]. Li et al [77] proposed a two-stream 2D ConvNet: one to extract features from spatial coordinates of the pose in a 3D manner (position, joints and frames) through a skeleton transformer module, which extracts weighted interpolated joints matrix.…”
Section: Global Representationsmentioning
confidence: 99%