2005
DOI: 10.1007/11526346_32
|View full text |Cite
|
Sign up to set email alerts
|

Dialogue Sequence Detection in Movies

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
29
0

Year Published

2005
2005
2011
2011

Publication Types

Select...
4
3

Relationship

3
4

Authors

Journals

citations
Cited by 20 publications
(30 citation statements)
references
References 8 publications
0
29
0
Order By: Relevance
“…The event segmentation and classification technique used here is described in detail and evaluated in [29,30,12]. It exploits a number of observations about film creation principles.…”
Section: Figure 2: Overall Approachmentioning
confidence: 99%
See 1 more Smart Citation
“…The event segmentation and classification technique used here is described in detail and evaluated in [29,30,12]. It exploits a number of observations about film creation principles.…”
Section: Figure 2: Overall Approachmentioning
confidence: 99%
“…Once the shot boundaries are known, the editing pace can be inferred. Two motion features are extracted per shot, MPEG-7 motion intensity [33] and a measure of camera motion [29]. A support vector machine based classifier is used to classify audio into one of: speech, music, silence and other audio.…”
Section: Figure 2: Overall Approachmentioning
confidence: 99%
“…As a starting point, four low-level audio features are extracted, namely the High Zero Crossing Rate Ratio, the Silence Ratio, the Short Term Energy, and the Short-Term Energy Variation. The effectiveness of these low-level features in helping to distinguish between speech and music has previously been demonstrated [11,18]. In order to classify each one second window of audio, a set of support vector machines (SVMs) are used, one for each audio category.…”
Section: Audio-visual Analysis Techniques Usedmentioning
confidence: 99%
“…The values obtained from processing each one second window are then up-sampled in order to compute the proportion of each audio category for each shot. At the end of this process, for each shot of a movie, there is a value for the percentage of speech, music, silence, quiet music, speech with background music and other audio present (further details of the audio classification process are provided in [18]). …”
Section: Audio-visual Analysis Techniques Usedmentioning
confidence: 99%
See 1 more Smart Citation