2021
DOI: 10.1609/aaai.v35i15.17607
|View full text |Cite
|
Sign up to set email alerts
|

Movie Summarization via Sparse Graph Construction

Abstract: We summarize full-length movies by creating shorter videos containing their most informative scenes. We explore the hypothesis that a summary can be created by assembling scenes which are turning points (TPs), i.e., key events in a movie that describe its storyline. We propose a model that identifies TP scenes by building a sparse movie graph that represents relations between scenes and is constructed using multimodal information. According to human judges, the summaries created by our approach are more inform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 39 publications
0
3
0
Order By: Relevance
“…We segment episodes into shots (using PySceneDetect 4 ) and map these to utterances in the corresponding transcript. Specifically, we align the closed captions in the video which are timestamped to the utterances in the transcript using Dynamic Time Warping (DTW; Myers and Rabiner 1981; Papalampidi et al 2021b). We thus create a one-to-many alignment where an utterance corresponds to one or more shots.…”
Section: Multimodal Augmentationmentioning
confidence: 99%
See 2 more Smart Citations
“…We segment episodes into shots (using PySceneDetect 4 ) and map these to utterances in the corresponding transcript. Specifically, we align the closed captions in the video which are timestamped to the utterances in the transcript using Dynamic Time Warping (DTW; Myers and Rabiner 1981; Papalampidi et al 2021b). We thus create a one-to-many alignment where an utterance corresponds to one or more shots.…”
Section: Multimodal Augmentationmentioning
confidence: 99%
“…Previous work on video-to-video summarization identifies highlights from YouTube videos, TV shows, or movies (Song et al, 2015;Gygli et al, 2014;De Avila et al, 2011;Papalampidi et al, 2021b). However, in most cases, either the videos are short or the datasets are small with a few hundred examples.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation