&lt;title&gt;DANCERS: Delft advanced news retrieval system&lt;/title&gt;

Hanjalic, Alan; Kakes, Geerd; Lagendijk, Reginald L.; Biemond, J.

doi:10.1117/12.410940

Cited by 6 publications

(5 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Topic detection has been mostly carried out using closed caption information, embedded captions and text obtained through speech recognition, by themselves or in combination with each other (See [8,9] for example). In such approaches, text is extracted from the video using some or all of the aforementioned sources and then processed using various heuristics to extract the topic(s).…”

Section: Motivationmentioning

confidence: 99%

Video Summarization Using Mpeg-7 Motion Activity and Audio Descriptors

et al. 2003

View full text Add to dashboard Cite

We present video summarization and indexing techniques using the MPEG-7 motion activity descriptor. The descriptor can be extracted in the compressed domain and is compact, and hence is easy to extract and match. We establish that the intensity of motion activity of a video shot is a direct indication of its summarizability. We describe video summarization techniques based on sampling in the cumulative motion activity space. We then describe combinations of the motion activity based techniques with generalized sound recognition that enable completely automatic generation of news and sports video summaries. Our summarization is computationally simple and flexible, which allows rapid generation of a summary of any desired length.This work may not be copied or reproduced in whole or in part for any commercial purpose. Permission to copy in whole or in part without payment of fee is granted for nonprofit educational and research purposes provided that all such whole or partial copies include the following: a notice that such copying is by permission of Mitsubishi Electric Information Technology Center America; an acknowledgment of the authors and individual contributions to the work; and all applicable portions of the copyright notice. Copying, reproduction, or republishing for any other purpose shall require a license with payment of fee to Mitsubishi Electric Information Technology Center America. All rights reserved. {ajayd,peker,regu,zxiong,romain}@merl.com AbstractWe present video summarization and indexing techniques using the MPEG-7 motion activity descriptor. The descriptor can be extracted in the compressed domain and is compact, and hence is easy to extract and match. We establish that the intensity of motion activity of a video shot is a direct indication of its summarizability. We describe video summarization techniques based on sampling in the cumulative motion activity space. We then describe combinations of the motion activity based techniques with generalized sound recognition that enable completely automatic generation of news and sports video summaries. Our summarization is computationally simple and flexible, which allows rapid generation of a summary of any desired length.

show abstract

Section: Motivationmentioning

confidence: 99%

Video Summarization Using Mpeg-7 Motion Activity and Audio Descriptors

et al. 2003

View full text Add to dashboard Cite

show abstract

“…Similarly, the pseudo-semantic distance is defined as the norm of the difference of the shot pseudo-semantic feature vectors as (9) We shall describe how these distance measures are used for browsing in Section V.…”

Section: Shot Similaritymentioning

confidence: 99%

“…A number of systems have been proposed to solve the digital video library management problem. These include the VideoQ system [5], which allows region-based queries with motion information, the Virage video Engine [6], CueVideo from IBM [7] which uses keywords obtained from speech recognition to search through video data, a home movie library management system from Intel [8], Delft University's DANCERS news retrieval system [9] where image and audio data, and speech transcripts are used to divide news programs into report segments and assign topics to each of these and the Fischlar system [10] developed by the Dublin City University for indexing and browsing broadcast TV content. One of the largest efforts in video indexing and access is the Informedia system [11] developed by Carnegie Mellon University which reportedly has more than 2,000 hours of news and documentary programs.…”

Section: Introductionmentioning

confidence: 99%

ViBE: A Compressed Video Database Structured for Active Browsing and Search

Taskiran

Chen

Albiol

et al. 2004

IEEE Trans. Multimedia

View full text Add to dashboard Cite

In this paper, we describe a unique new paradigm for video database management known as ViBE (video indexing and browsing environment). ViBE is a browseable/searchable paradigm for organizing video data containing a large number of sequences. The system first segments video sequences into shots by using a new feature vector known as the Generalized Trace obtained from the DC-sequence of the compressed data. Each video shot is then represented by a hierarchical structure known as the shot tree. The shots are then classified into pseudo-semantic classes that describe the shot content. Finally, the results are presented to the user in an active browsing environment using a similarity pyramid data structure. The similarity pyramid allows the user to view the video database at various levels of detail. The user can also define semantic classes and reorganize the browsing environment based on relevance feedback. We describe how ViBE performs on a database of MPEG sequences.

show abstract

“…4 into a report. As an illustration, in the news segmentation approach proposed by Hanjalic et al, 35 first visual low-level features are used to detect the starting and ending points of all anchorperson shots. Several algorithms proposed in recent literature can be used for this purpose.…”

Section: Coherence Modeling: Are Visual Low-level Features Sufficient?mentioning

confidence: 99%

Recent Advances in Video Content Analysis: From Visual Features to Semantic Video Segments

Hanjalic

Lagendijk

Biemond

2001

Int. J. Image Grap.

Self Cite

View full text Add to dashboard Cite

This paper addresses the problem of automatically partitioning a video into semantic segments using visual low-level features only. Semantic segments may be understood as building content blocks of a video with a clear sequential content structure. Examples are reports in a news program, episodes in a movie, scenes of a situation comedy or topic segments of a documentary. In some video genres like news programs or documentaries, the usage of different media (visual, audio, speech, text) may be beneficial or is even unavoidable for reliably detecting the boundaries between semantic segments. In many other genres, however, the pay-off in using different media for the purpose of high-level segmentation is not high. On the one hand, relating the audio, speech or text to the semantic temporal structure of video content is generally very difficult. This is especially so in "acting" video genres like movies and situation comedies. On the other hand, the information contained in the visual stream of these video genres often seems to provide the major clue about the position of semantic segments boundaries. Partitioning a video into semantic segments can be performed by measuring the coherence of the content along neighboring video shots of a sequence. The segment boundaries are then found at places (e.g., shot boundaries) where the values of content coherence are sufficiently low. On the basis of two state-of-the-art techniques for content coherence modeling, we illustrate in this paper the current possibilities for detecting the boundaries of semantic segments using visual low-level features only.

show abstract

<title>DANCERS: Delft advanced news retrieval system</title>

Cited by 6 publications

References 12 publications

Video Summarization Using Mpeg-7 Motion Activity and Audio Descriptors

Video Summarization Using Mpeg-7 Motion Activity and Audio Descriptors

ViBE: A Compressed Video Database Structured for Active Browsing and Search

Recent Advances in Video Content Analysis: From Visual Features to Semantic Video Segments

Contact Info

Product

Resources

About