2022
DOI: 10.3390/math10183290
|View full text |Cite
|
Sign up to set email alerts
|

STSM: Spatio-Temporal Shift Module for Efficient Action Recognition

Abstract: The modeling, computational complexity, and accuracy of spatio-temporal models are the three major foci in the field of video action recognition. The traditional 2D convolution has low computational complexity, but it cannot capture the temporal relationships. Although the 3D convolution can obtain good performance, it is with both high computational complexity and a large number of parameters. In this paper, we propose a plug-and-play Spatio-Temporal Shift Module (STSM), which is a both effective and high-per… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 44 publications
0
2
0
Order By: Relevance
“…Lin et al [27] proposed a time shift module for hardware efficient video recognition, which moves part of the channel along the time dimension to exchange information with adjacent frames. Yang et al [28] proposed a spatial-temporal displacement module for efficient video recognition. This module moves some channels in the time dimension and space dimension of different channels, enabling the network to learn its spatial-temporal characteristics.…”
Section: B Time Module Pluginmentioning
confidence: 99%
“…Lin et al [27] proposed a time shift module for hardware efficient video recognition, which moves part of the channel along the time dimension to exchange information with adjacent frames. Yang et al [28] proposed a spatial-temporal displacement module for efficient video recognition. This module moves some channels in the time dimension and space dimension of different channels, enabling the network to learn its spatial-temporal characteristics.…”
Section: B Time Module Pluginmentioning
confidence: 99%
“…This module can be inserted in existing 2D CNNs to achieve time modeling of zero computation and zero parameters. Although TSM was widely used in tasks such as video classification and action recognition [ 22 , 23 ], no researcher applied TSM to the field of pig farming. In this study, we aimed to insert TSM into four widely used 2D CNN models, which enhance the model’s learning ability on time features, while maintaining the model’s performance in handling spatial features.…”
Section: Introductionmentioning
confidence: 99%