2018
DOI: 10.1007/978-3-030-01264-9_24
|View full text |Cite
|
Sign up to set email alerts
|

Pivot Correlational Neural Network for Multimodal Video Categorization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 12 publications
(12 citation statements)
references
References 17 publications
0
12
0
Order By: Relevance
“…The proposed approach is evaluated on FCVID using the mean average precision (mAP) and compared against the top-scoring approaches of the literature, i.e. PivotCorrNN [15], LiteEval [30], AdaFrame [31], SCSampler [17], ST-VLAD [22] and AR-Net [19]. On YLI-MED, the top-1 accuracy is utilized, and the comparison is performed against the top-scoring literature approaches for this dataset, i.e.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…The proposed approach is evaluated on FCVID using the mean average precision (mAP) and compared against the top-scoring approaches of the literature, i.e. PivotCorrNN [15], LiteEval [30], AdaFrame [31], SCSampler [17], ST-VLAD [22] and AR-Net [19]. On YLI-MED, the top-1 accuracy is utilized, and the comparison is performed against the top-scoring literature approaches for this dataset, i.e.…”
Section: Resultsmentioning
confidence: 99%
“…Spatiotemporal VLAD (ST-VLAD) is presented in [22], encoding convolutional features across different segments to represent the video. In [15], PivotCor-rNN is proposed, exploiting correlations among different video modalities. S2L is introduced in [32], utilizing a pretrained ResNet and an LSTM to model separately the spatial and temporal video information.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…We can find that our model achieves higher event recognition performance compared with some Appearance-based methods on both datasets. Besides, our model is also better than Pivot CorrNN [19], which uses seven types of pre-extracted features to perform event recognition.…”
Section: Comparison To Start Of the Artmentioning
confidence: 97%