2023
DOI: 10.3389/fnbot.2023.1050167
|View full text |Cite
|
Sign up to set email alerts
|

3D network with channel excitation and knowledge distillation for action recognition

Abstract: Modern action recognition techniques frequently employ two networks: the spatial stream, which accepts input from RGB frames, and the temporal stream, which accepts input from optical flow. Recent researches use 3D convolutional neural networks that employ spatiotemporal filters on both streams. Although mixing flow with RGB enhances performance, correct optical flow computation is expensive and adds delay to action recognition. In this study, we present a method for training a 3D CNN using RGB frames that rep… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 45 publications
(50 reference statements)
0
1
0
Order By: Relevance
“…During the training process, the student model attempts to replicate the predictions of the teacher model in order to transfer its knowledge. This method can reduce the size of the model, thereby improving its efficiency (Lan et al, 2022 ; Hu et al, 2023 ). Model pruning (Han et al, 2015 ) is a technique of compressing the model's size by removing unimportant neurons or connections.…”
Section: Related Studiesmentioning
confidence: 99%
“…During the training process, the student model attempts to replicate the predictions of the teacher model in order to transfer its knowledge. This method can reduce the size of the model, thereby improving its efficiency (Lan et al, 2022 ; Hu et al, 2023 ). Model pruning (Han et al, 2015 ) is a technique of compressing the model's size by removing unimportant neurons or connections.…”
Section: Related Studiesmentioning
confidence: 99%