The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
Proceedings of the ACM International Conference on Image and Video Retrieval 2009
DOI: 10.1145/1646396.1646439
|View full text |Cite
|
Sign up to set email alerts
|

Movie segmentation into scenes and chapters using locally weighted bag of visual words

Abstract: Movies segmentation into semantically correlated units is a quite tedious task due to "semantic gap". Low-level features do not provide useful information about the semantical correlation between shots and usually fail to detect scenes with constantly dynamic content. In the method we propose herein, local invariant descriptors are used to represent the key-frames of video shots and a visual vocabulary is created from these descriptors resulting to a visual words histogram representation (bag of visual words) … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0
17

Year Published

2011
2011
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 28 publications
(41 citation statements)
references
References 11 publications
0
24
0
17
Order By: Relevance
“…Such a result is quite competitive with the state of the art techniques introduced in (Rasheed et al, 2005), (Chasanis et al, 2009), (Zhu et al, 2009) which yield precision/recall rates varying between 65% and 72%.…”
Section: Figure 13 [Detected Scenes]mentioning
confidence: 93%
See 1 more Smart Citation
“…Such a result is quite competitive with the state of the art techniques introduced in (Rasheed et al, 2005), (Chasanis et al, 2009), (Zhu et al, 2009) which yield precision/recall rates varying between 65% and 72%.…”
Section: Figure 13 [Detected Scenes]mentioning
confidence: 93%
“…The validation of our scene extraction method has been performed on a corpus of 6 sitcoms and 6 Hollywood movies (Tables 6 and 7) also used for evaluation purposes in the state of the art algorithms presented in (Rasheed et al, 2005), (Chasanis et al, 2009), (Zhu et al, 2009). Fig.…”
Section: Scene/dvd Chapter Extractionmentioning
confidence: 99%
“…More recent techniques (Chasanis et al, 2009), (Zhu et al, 2009) introduce in the analysis process useful concepts such as temporal constraints and visual similarity.…”
Section: Related Workmentioning
confidence: 99%
“…Concerning the keyframe visual similarity involved in the above-described process, we have considered two different approaches, based on (1) chi-square distance between HSV color histograms, and (2) the number of matched interest points determined based on SIFT descriptors with a Kd-tree matching technique [20].…”
Section: Scene Segmentationmentioning
confidence: 99%