2008 15th IEEE International Conference on Image Processing 2008
DOI: 10.1109/icip.2008.4712308
|View full text |Cite
|
Sign up to set email alerts
|

Movie summarization based on audiovisual saliency detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
26
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 37 publications
(26 citation statements)
references
References 14 publications
0
26
0
Order By: Relevance
“…Motion, face, camera and audio attention models were cues to capture salient information and identify the segments to compose a summary [2]. In our previous work, saliency was modeled independently in each modality, using meaningful temporal modulations in multiple frequencies for the audio and spatiotemporal features (color, motion, intensity) for the visual stream [4,5]. An integrated audiovisual saliency curve formed the basis of a bottom-up, content-independent, summarization technique.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations
“…Motion, face, camera and audio attention models were cues to capture salient information and identify the segments to compose a summary [2]. In our previous work, saliency was modeled independently in each modality, using meaningful temporal modulations in multiple frequencies for the audio and spatiotemporal features (color, motion, intensity) for the visual stream [4,5]. An integrated audiovisual saliency curve formed the basis of a bottom-up, content-independent, summarization technique.…”
Section: Introductionmentioning
confidence: 99%
“…The segment selection and skim rendering algorithm [5], based on the multimodal saliency curve follows the steps: 1. AVT is filtered with a median filter of length 2M + 1 frames.…”
Section: Video Summarizationmentioning
confidence: 99%
See 2 more Smart Citations
“…They use motion, face, and camera attention along with audio attention models (audio saliency and speech/music) as cues to capture salient information and identify the audio and video segments to compose the summary. Rapantzikos et al (Evangelopoulos et al 2008(Evangelopoulos et al , 2009) build further on visual, audio, and textual attention models for visual summarization. The authors form a multimodal saliency curve integrating the aural, visual, and textual streams of videos based on efficient audio, image, and language processing and employ it as a metric for video event detection and abstraction.…”
Section: Novelty Detection and Video Summarizationmentioning
confidence: 99%