2017 IEEE Winter Conference on Applications of Computer Vision (WACV) 2017
DOI: 10.1109/wacv.2017.115
|View full text |Cite
|
Sign up to set email alerts
|

Semantic Text Summarization of Long Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
30
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(30 citation statements)
references
References 22 publications
0
30
0
Order By: Relevance
“…(5) Semantic-Sum [19]: a recent method that also identifies the video segments as ours. We find that this method gets best performance when setting sentence length as 3 and using Latent Semantic Analysis [21] in summarization module.…”
Section: Comparison With Other Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…(5) Semantic-Sum [19]: a recent method that also identifies the video segments as ours. We find that this method gets best performance when setting sentence length as 3 and using Latent Semantic Analysis [21] in summarization module.…”
Section: Comparison With Other Methodsmentioning
confidence: 99%
“…(1) When generating each sentence of the paragraph, the method in [29] requires the features from the entire video, which is expensive for very long videos, while our method only requires features in selected proposals. (2) In [19], the clips are selected according to frame quality in advance as a preprocessing step, without taking into account the coherence of narration. This way will lead to redundancy in the resulting paragraph.…”
Section: Related Workmentioning
confidence: 99%
“…Here the summary is based on sentence to centroid score, cue phrase score, sentence position score, numerical data and tf-idf score. Textual summaries of long videos (Shagan et al, 2017) are generated using recurrent networks where key frames are taken from impactful segments and are converted to textual annotations. The sequence of events in the video are summarized to generate a paragraph description.…”
Section: General Methodsmentioning
confidence: 99%
“…In automotive or indoor robotic visual perception problems, simple concatenation techniques perform well but they fall short in some applications like video captioning [10,33] or summarization [42] where long term dependencies are required. LSTMs in such cases offer a better alternative [59,45].…”
Section: Feature Aggregationmentioning
confidence: 99%