2021
DOI: 10.1109/tip.2020.3036749
|View full text |Cite
|
Sign up to set email alerts
|

A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 20 publications
(9 citation statements)
references
References 68 publications
1
8
0
Order By: Relevance
“…STRA-Net [11] adopts a dual-pathway architecture combining 2D ResNet50 with ConvLSTM, while the proposed STCED utilizes a dualpathway 3D ResNet50 as the encoder, which implicitly justifies the capability of 3DCNN. As shown in Table 3, all of these four models [11], [13]- [15] perform far behind STCED on the DHF1K test set, which verifies the effectiveness of the proposed model. 1 https://mmcheng.net/videosal/…”
Section: E Comparison With the State-of-the-artsupporting
confidence: 53%
See 4 more Smart Citations
“…STRA-Net [11] adopts a dual-pathway architecture combining 2D ResNet50 with ConvLSTM, while the proposed STCED utilizes a dualpathway 3D ResNet50 as the encoder, which implicitly justifies the capability of 3DCNN. As shown in Table 3, all of these four models [11], [13]- [15] perform far behind STCED on the DHF1K test set, which verifies the effectiveness of the proposed model. 1 https://mmcheng.net/videosal/…”
Section: E Comparison With the State-of-the-artsupporting
confidence: 53%
“…The network uses two tightly coupled streams to extract appearance and motion features, and then a lightweight ConvGRU, an alternative to ConvLSTM, to model long-term temporal dependence. Zhang et al [15] propose a select and reweight fusion module to automatically weight spatial and temporal features from different domains to enhance the meaningful features and decrease less useful ones and integrate them. In order to consider interframe motion cues, they design an attentionaware ConvLSTM to predict the final salient region based on integrated features.…”
Section: B Modern Dynamic Saliency Modelsmentioning
confidence: 99%
See 3 more Smart Citations