2017
DOI: 10.48550/arxiv.1708.03805
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
27
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 34 publications
(28 citation statements)
references
References 8 publications
1
27
0
Order By: Relevance
“…The change in both training and validation sets generates a small discrepancy between experiments conducted at different times. We explicitly denote results on the original Kinetics dataset with an asterisk (*) in all tables and provide the list of videos available at the time of our experiments to enable others to reproduce our results 1 . HMDB-51 and UCF-101.…”
Section: Datasetsmentioning
confidence: 99%
See 1 more Smart Citation
“…The change in both training and validation sets generates a small discrepancy between experiments conducted at different times. We explicitly denote results on the original Kinetics dataset with an asterisk (*) in all tables and provide the list of videos available at the time of our experiments to enable others to reproduce our results 1 . HMDB-51 and UCF-101.…”
Section: Datasetsmentioning
confidence: 99%
“…Kinetics-400 ARTNet [33] RGB+Flow 72.4* TSN [30] RGB+Flow 73.9* R(2+1)D [31] RGB+Flow 75.4* NL I3D [34] RGB 77.7* SAN [1] RGB+Flow+Audio 77.7* I3D [3] RGB 70.6 / 71.1* I3D [3] Flow 62.1 / 63.9* I3D [3] RGB+Flow 72.6 / 74.1* S3D-G [35] RGB 74.0 / 74.7* S3D-G [35] Flow 67.3 / 68.0* S3D-G [35] RGB+Flow 76.2 / 77.2* D3D RGB 75.9 D3D+S3D-G RGB+RGB 76.5…”
Section: Modalitymentioning
confidence: 99%
“…A recent research topic is to estimate optical flow by CNNs [8,35,31,18,26,4]. These approaches cast the optical flow estimation as an optimization problem with respect to the CNN parameters.…”
Section: Related Workmentioning
confidence: 99%
“…Top-5 blVNet [18] 73.5 91.2 --STM [31] 73.7 91.6 --TEA [41] 76.1 92.5 --TS S3D-G [60] 77.2 93.0 --3-stream SATT [8] 77.7 93.2 --AVSlowFast, R101 [59] 78.8 93.6 85.0 † -LGD-3D R101 [48] 79.4 94.4 --SlowFast R101-NL [20] 79.8 93.9 --ViViT-Base [6] 80 [32] and Kinetics Sound [4]. We report top-1 and top-5 classification accuracy.…”
Section: Moments In Timementioning
confidence: 99%