2020
DOI: 10.1109/access.2020.3027386
|View full text |Cite
|
Sign up to set email alerts
|

Temporal Segment Connection Network for Action Recognition

Abstract: Two-stream Convolutional Neural Networks have shown excellent performance in video action recognition. Most existing works train each sampling group independently, or just fuse at the last level, which obviously ignore the continuity of action in temporal and the complementary information between action fragments. In this paper, a temporal segment connection network is proposed to overcome these limitations. On the one hand, the forget gate module of the long short-term memory (LSTM) network is used to establi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 29 publications
0
6
0
Order By: Relevance
“…Since the video descriptor from the whole video could not be extracted, this method might not suitable for videos with various durations. Temporal segment network (TSN) [ 24 ] was designed for capturing features from the whole frame sequences with their modified two-stream networks. Unlike a traditional method, the segment consensus function was added as a post-processing step.…”
Section: Input Data For Tapg Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…Since the video descriptor from the whole video could not be extracted, this method might not suitable for videos with various durations. Temporal segment network (TSN) [ 24 ] was designed for capturing features from the whole frame sequences with their modified two-stream networks. Unlike a traditional method, the segment consensus function was added as a post-processing step.…”
Section: Input Data For Tapg Networkmentioning
confidence: 99%
“…Moreover, the base module was designed specifically for RGB and optical flow from video descriptor. The two-stream network [ 24 ] was utilized to extract the rich local temporal video representation as input, to exploit the rich local behaviors within the video sequence.…”
Section: The Review Of Tapg Networkmentioning
confidence: 99%
“…Recently, many works for video classification [5], [16], [21], [23], [29], [32], [37], [42] have focused on an ability to model the temporal variation, dynamics of an action (i.e., visual tempo [42]), called temporal modeling in literature. Unlike 2D image classification, video classification should distinguish visual tempo variation as well as its semantic appearance.…”
Section: Introductionmentioning
confidence: 99%
“…In modern society, pirated goods have become a global issue as a source of illicit funds for crime organizations. [ 1 ] Some countermeasures such as physical anti‐counterfeiting technologies have been developed to prevent replication in various forms: watermarks, [ 2,3 ] intaglio printing, [ 4 ] luminescent ink, [ 5,6 ] thermal emissive label, [ 7–10 ] and magnetic ink. [ 11,12 ] Most importantly, thermal emissive labels using tailored infrared (IR) emissivity implemented using photonic structures have received significant research attention as a promising anti‐counterfeiting candidate owing to their facile design and fabrication process for a textured metal surface such as photonic crystal cavities, [ 13–16 ] nano‐antennas, [ 17–19 ] metamaterials, [ 20–22 ] and gratings.…”
Section: Introductionmentioning
confidence: 99%