2018
DOI: 10.1002/cpe.4487
|View full text |Cite
|
Sign up to set email alerts
|

Temporal sparse feature auto‐combination deep network for video action recognition

Abstract: Summary In order to deal with action recognition for large‐scale video data, we present a spatio‐temporal auto‐combination deep network, which is able to extract deep features from short video segments by making full use of temporal contextual correlation of corresponding pixels among successive video frames. Based on conventional sparse encoding, we further consider the representative features in adjacent nodes of the hidden layers according to activation states similarities. A sparse auto‐combination strateg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…In the following subsection, to show the generality of the TVS-AR method, we describe and evaluate the proposed CNN-based model using the T V S F seq for AR. We present the classification results to prove the performance and suitability of the presented approach using low-resolution T V S F seq in terms of accuracy [65]. We used frame-based approach for recognizing 16 different activities showing the efficacy of a model by demonstrating it for a high HAR accuracy score of approximately 90.99%.…”
Section: Multi-occupant Activity Recognitionmentioning
confidence: 91%
“…In the following subsection, to show the generality of the TVS-AR method, we describe and evaluate the proposed CNN-based model using the T V S F seq for AR. We present the classification results to prove the performance and suitability of the presented approach using low-resolution T V S F seq in terms of accuracy [65]. We used frame-based approach for recognizing 16 different activities showing the efficacy of a model by demonstrating it for a high HAR accuracy score of approximately 90.99%.…”
Section: Multi-occupant Activity Recognitionmentioning
confidence: 91%
“…Action recognition, as an important research area of IoT, has a wide application prospect in daily scenes such as automatic driving, video surveillance, etc. Much works have been recently devoted to action recognition, and among them, local spatiotemporal features are shown to be successful on a variety of challenging action recognition datasets 1–3 …”
Section: Introductionmentioning
confidence: 99%
“…CNNs have also been widely applied in video content analysis. Wang et al apply CNN networks for automatic recognition of human actions in surveillance videos.…”
mentioning
confidence: 99%