2023
DOI: 10.3390/jimaging9070130
|View full text |Cite
|
Sign up to set email alerts
|

Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework

Abstract: Vision-based human activity recognition (HAR) has emerged as one of the essential research areas in video analytics. Over the last decade, numerous advanced deep learning algorithms have been introduced to recognize complex human actions from video streams. These deep learning algorithms have shown impressive performance for the video analytics task. However, these newly introduced methods either exclusively focus on model performance or the effectiveness of these models in terms of computational efficiency, r… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 114 publications
0
2
0
Order By: Relevance
“…On the one hand, these works achieved stateof-the-art accuracy for many datasets. The same occurs for [21] and [22], but using a combination of CNN and GRU. On the other hand, they do not apply to batterypowered devices that require an edge or cloud device to process this information.…”
Section: Final Remarksmentioning
confidence: 87%
See 1 more Smart Citation
“…On the one hand, these works achieved stateof-the-art accuracy for many datasets. The same occurs for [21] and [22], but using a combination of CNN and GRU. On the other hand, they do not apply to batterypowered devices that require an edge or cloud device to process this information.…”
Section: Final Remarksmentioning
confidence: 87%
“…Ullah and Munir [21] propose a dual attentional CNN (DA-CNN) architecture that leverages a unified channel-spatial attention mechanism to extract HAR features in video frames. The dual channel-spatial attention layers and the CNN layers learn to be more selective in the spatial receptive fields with objects within the feature maps.…”
Section: Hybrid Approachesmentioning
confidence: 99%