2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00773
|View full text |Cite
|
Sign up to set email alerts
|

HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
100
0
2

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 182 publications
(102 citation statements)
references
References 21 publications
0
100
0
2
Order By: Relevance
“…A typical result of these approaches is a sequence of keyframes or a video excerpt comprising the most important parts of a video. More recent methods treat video summarization as an optimization problem [7,10,34] or they utilize recurrent neural networks [35,36] based on, for instance, long short-term memory cells (LSTMs), which are able to capture temporal or sequential information very well. Another use case for LSTMs is proposed by Mahasseni et al [22], who suggest a generative adversarial network (GAN) consisting of an LSTM-based autoencoder and a discriminator.…”
Section: Related Workmentioning
confidence: 99%
“…A typical result of these approaches is a sequence of keyframes or a video excerpt comprising the most important parts of a video. More recent methods treat video summarization as an optimization problem [7,10,34] or they utilize recurrent neural networks [35,36] based on, for instance, long short-term memory cells (LSTMs), which are able to capture temporal or sequential information very well. Another use case for LSTMs is proposed by Mahasseni et al [22], who suggest a generative adversarial network (GAN) consisting of an LSTM-based autoencoder and a discriminator.…”
Section: Related Workmentioning
confidence: 99%
“…Unsupervised video summarization methods [5], [6] usually use manually defined criteria to extract key frames or key shots. While supervised ones [7], [8] learn models with the help of human-annotated data to determine which frames or shots are more important. In this paper, we mainly focus on supervised ones.…”
Section: Introductionmentioning
confidence: 99%
“…Two kinds of methods are designed to avoid browsing the whole video. The first kind is video summarization methods [32,58], which generate a short synopsis for a long video. The second kind of methods [7,8,13,14,19,22,31,37,41] try to trim the video segment of interest.…”
Section: Introductionmentioning
confidence: 99%