Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269)
DOI: 10.1109/icip.1998.723500
|View full text |Cite
|
Sign up to set email alerts
|

Identification of story units in audio-visual sequences by joint audio and video processing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
21
0
1

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 32 publications
(22 citation statements)
references
References 5 publications
0
21
0
1
Order By: Relevance
“…First, a formal definition of the event can be given as a model which is then used for detection. Such approaches are usually based on rules that can be either defined by a human expert (see, e.g., Saraceno and Leonardi (1998), Tovinkere and Qian (2001), Lienhart et al (1998), Zhong and Chang (2001)) or inferred from examples as in Perlovsky (1998). Alternately, machine learning techniques can be used to train a system from examples with the goal of deciding whether a video extract contains the event or not.…”
Section: Introductionmentioning
confidence: 99%
“…First, a formal definition of the event can be given as a model which is then used for detection. Such approaches are usually based on rules that can be either defined by a human expert (see, e.g., Saraceno and Leonardi (1998), Tovinkere and Qian (2001), Lienhart et al (1998), Zhong and Chang (2001)) or inferred from examples as in Perlovsky (1998). Alternately, machine learning techniques can be used to train a system from examples with the goal of deciding whether a video extract contains the event or not.…”
Section: Introductionmentioning
confidence: 99%
“…Otherwise, these highenergy segments are checked for periodicity using an autocorrelation function. Since both voiced sounds and music may have significant peaks in their autocorrelation function, Zero Crossing Rate (ZCR) [4] of these signals is also measured. ZCR detects abrupt changes which should occur in speech signals due to existence of both voiced (low ZCR) and unvoiced (high ZCR) sounds.…”
Section: Audio Analysismentioning
confidence: 99%
“…Audio analysis is achieved based on an algorithm in [4]. According to this algorithm, an audio track is segmented into four classes as, silence, speech, music and noise.…”
Section: Audio Analysismentioning
confidence: 99%
See 2 more Smart Citations