2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
DOI: 10.1109/icassp.2001.941190
|View full text |Cite
|
Sign up to set email alerts
|

New approaches to audio-visual segmentation of TV news for automatic topic retrieval

Abstract: This paper presents two new real-time approaches to segmentation of TV news shows into topics. The goal of this research work is the high precision retrieval of topics from TV news. For that purpose, the detection of correct topic boundaries is of great importance. We introduce a stochastic and a rule-based topic model based on HMMs. The former combines features from the visual as well as from the audio channel of the news show, whereas the latter uses the video channel only. They are compared to the detection… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0
1

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(9 citation statements)
references
References 2 publications
(3 reference statements)
0
8
0
1
Order By: Relevance
“…The internal structure of video has been modeled in literature to facilitate the detection of logical unit boundaries. Iurgel [14] proposed a video model for news shows, part of which is shown in Fig. 9.…”
Section: Segmentation Mechanismsmentioning
confidence: 99%
See 3 more Smart Citations
“…The internal structure of video has been modeled in literature to facilitate the detection of logical unit boundaries. Iurgel [14] proposed a video model for news shows, part of which is shown in Fig. 9.…”
Section: Segmentation Mechanismsmentioning
confidence: 99%
“…They consist of topic units with compilation and continuity cutting. Specific news models have been used to capture this type of structure [14,24]. Training and instructional videos focus on teaching and often an instructor is audibly and visibly addressing the audience of the video or interacting with an audience visible in the video.…”
Section: Data Domain and Logical Unit Typementioning
confidence: 99%
See 2 more Smart Citations
“…Esse fato pode ser explicado pela capacidade da parte visual de transmitir uma grande parte da semântica latente presente em um vídeo, comprovada em vários trabalhos (Fabro e Böszörmenyi, 2013;Coimbra, 2011;Iurgel et al, 2001). …”
Section: Segmentação Em Cenas Com Descritores Visuaisunclassified