Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework

Ullah, Hayat; Munir, Arslan

doi:10.3390/jimaging9070130

Cited by 11 publications

(2 citation statements)

References 114 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…On the one hand, these works achieved stateof-the-art accuracy for many datasets. The same occurs for [21] and [22], but using a combination of CNN and GRU. On the other hand, they do not apply to batterypowered devices that require an edge or cloud device to process this information.…”

Section: Final Remarksmentioning

confidence: 87%

See 1 more Smart Citation

Deploying Human Activity Recognition in Embedded RISC-V Processors

Nunes,

Reusch,

Luza

et al. 2024

Preprint

View full text Add to dashboard Cite

Human Activity Recognition (HAR) is an important area of research due to its applications in health monitoring, elderly care, and personal fitness tracking. The challenge is deploying efficient and accurate HAR systems on resource-constrained embedded devices, which require low power consumption and processing efficiency. This work optimizes a Convolutional Neural Network (CNN) model for HAR, targeting resource-constrained processors. The goal is to balance accuracy, performance, and power consumption for real-world deployment in wearable devices. Key contributions include introducing an Extended 1D CNN model that enhances temporal awareness and accuracy without the overhead of floating-point computations, evaluating and applying quantization methods to minimize model size with minimal accuracy loss, and assessing the model's performance on a RISC-V processor. Results show an accuracy increase from 74% to 87.2%. Memory optimization using Lookup Table (LUT) quantization reduces the memory required for model parameters by 57%. This research underscores the potential for advanced neural network models on low-power RISC-V processors in real-time HAR, with significant implications for health monitoring and smart environments.

show abstract

Section: Final Remarksmentioning

confidence: 87%

“…Ullah and Munir [21] propose a dual attentional CNN (DA-CNN) architecture that leverages a unified channel-spatial attention mechanism to extract HAR features in video frames. The dual channel-spatial attention layers and the CNN layers learn to be more selective in the spatial receptive fields with objects within the feature maps.…”

Section: Hybrid Approachesmentioning

confidence: 99%