Temporal sparse feature auto‐combination deep network for video action recognition

Wang, Qicong; Gong, Dingxi; Qi, Man; Shen, Yehu; Lei, Yunqi

doi:10.1002/cpe.4487

Cited by 5 publications

(3 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the following subsection, to show the generality of the TVS-AR method, we describe and evaluate the proposed CNN-based model using the T V S F seq for AR. We present the classification results to prove the performance and suitability of the presented approach using low-resolution T V S F seq in terms of accuracy [65]. We used frame-based approach for recognizing 16 different activities showing the efficacy of a model by demonstrating it for a high HAR accuracy score of approximately 90.99%.…”

Section: Multi-occupant Activity Recognitionmentioning

confidence: 91%

uMoDT: an unobtrusive multi-occupant detection and tracking using robust Kalman filter for real-time activity recognition

et al. 2020

View full text Add to dashboard Cite

Human activity recognition (HAR) is an important branch of human-centered research. Advances in wearable and unobtrusive technologies offer many opportunities for HAR. While much progress has been made in HAR using wearable technology, it still remains a challenging task using unobtrusive (non-wearable) sensors. This paper investigates detection and tracking of multi-occupant HAR in a smart-home environment, using a novel low-resolution Thermal Vision Sensor (TVS). Specifically, the research presents the development and implementation of a two-step framework, consisting of a Computer Vision (CV) based method to detect and track multiple occupants combined with Convolutional Neural Network (CNN) based HAR. The proposed algorithm uses frame-difference over consecutive frames for occupant detection, a set of morphological operations to refine identified objects, and features are extracted before applying a Kalman filter for tracking. Laterally, a 19-layer CNN architecture is used for HAR and afterward the results from both methods are fused using time interval based sliding window. This approach is evaluated through a series of experiments based on benchmark Thermal Infrared datasets (VOT-TIR2016) and multi-occupant data collected from TVS. Results demonstrate that the proposed framework is capable of detecting and tracking 88.46% of multi-

show abstract

Section: Multi-occupant Activity Recognitionmentioning

confidence: 91%

uMoDT: an unobtrusive multi-occupant detection and tracking using robust Kalman filter for real-time activity recognition

et al. 2020

View full text Add to dashboard Cite

show abstract

“…Action recognition, as an important research area of IoT, has a wide application prospect in daily scenes such as automatic driving, video surveillance, etc. Much works have been recently devoted to action recognition, and among them, local spatiotemporal features are shown to be successful on a variety of challenging action recognition datasets 1–3 …”

Section: Introductionmentioning

confidence: 99%

Real‐time action feature extraction via fast PCA‐Flow

Chen

et al. 2019

Concurrency and Computation

View full text Add to dashboard Cite

Summary Action recognition is a research hotspot in the field of Internet of Things (IoT). Currently, local pixel‐domain spatiotemporal feature extraction methods have reached the state‐of‐the‐art action recognition performance on many challenging datasets. However, the poor computational complexity of these approaches prevents them from scaling up to real‐time applications. For solving this problem, we present a novel real‐time video feature extraction technique by exploiting the fast PCA‐Flow algorithm. Firstly, we down‐sample video images in form of grid. Based on the down‐sampling images, PCA‐Flow algorithm is used to calculate optical flow among adjacent images. The PCA‐Flow matrices are then expanded to the original video image size by using efficient gCLSR super‐resolution method to keep the inherent geometric structure of the optical flow. Finally, we compute action descriptors based on original pixel frames and the enlarged PCA‐Flow images. The proposed approach is validated on three challenging datasets: UCF50, Hollywood2, and HMDB51. Experimental results indicate that the proposed method is more efficient in computation and can achieve competitive quality than the state‐of‐the‐art methods.

show abstract

“…CNNs have also been widely applied in video content analysis. Wang et al apply CNN networks for automatic recognition of human actions in surveillance videos.…”

mentioning

confidence: 99%

High performance deep learning techniques for big data analytics

2018

Concurrency and Computation

View full text Add to dashboard Cite

Temporal sparse feature auto‐combination deep network for video action recognition

Cited by 5 publications

References 30 publications

uMoDT: an unobtrusive multi-occupant detection and tracking using robust Kalman filter for real-time activity recognition

uMoDT: an unobtrusive multi-occupant detection and tracking using robust Kalman filter for real-time activity recognition

Real‐time action feature extraction via fast PCA‐Flow

High performance deep learning techniques for big data analytics

Contact Info

Product

Resources

About