2001
DOI: 10.1117/12.410940
|View full text |Cite
|
Sign up to set email alerts
|

<title>DANCERS: Delft advanced news retrieval system</title>

Abstract: In this paper we present a system for automated analysis, classification and indexing of broadcast news programs. The system first analyzes the visual and the speech stream of an input news program in order to obtain an initial partitioning of the program into the so-called report segments. The analysis of the visual stream provides the boundaries of the report segments lying at the beginning and the end of each anchorperson shot. This analysis step is performed by applying an existing technique for anchorpers… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2001
2001
2006
2006

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 12 publications
0
5
0
Order By: Relevance
“…Topic detection has been mostly carried out using closed caption information, embedded captions and text obtained through speech recognition, by themselves or in combination with each other (See [8,9] for example). In such approaches, text is extracted from the video using some or all of the aforementioned sources and then processed using various heuristics to extract the topic(s).…”
Section: Motivationmentioning
confidence: 99%
“…Topic detection has been mostly carried out using closed caption information, embedded captions and text obtained through speech recognition, by themselves or in combination with each other (See [8,9] for example). In such approaches, text is extracted from the video using some or all of the aforementioned sources and then processed using various heuristics to extract the topic(s).…”
Section: Motivationmentioning
confidence: 99%
“…Similarly, the pseudo-semantic distance is defined as the norm of the difference of the shot pseudo-semantic feature vectors as (9) We shall describe how these distance measures are used for browsing in Section V.…”
Section: Shot Similaritymentioning
confidence: 99%
“…A number of systems have been proposed to solve the digital video library management problem. These include the VideoQ system [5], which allows region-based queries with motion information, the Virage video Engine [6], CueVideo from IBM [7] which uses keywords obtained from speech recognition to search through video data, a home movie library management system from Intel [8], Delft University's DANCERS news retrieval system [9] where image and audio data, and speech transcripts are used to divide news programs into report segments and assign topics to each of these and the Fischlar system [10] developed by the Dublin City University for indexing and browsing broadcast TV content. One of the largest efforts in video indexing and access is the Informedia system [11] developed by Carnegie Mellon University which reportedly has more than 2,000 hours of news and documentary programs.…”
Section: Introductionmentioning
confidence: 99%
“…4 into a report. As an illustration, in the news segmentation approach proposed by Hanjalic et al, 35 first visual low-level features are used to detect the starting and ending points of all anchorperson shots. Several algorithms proposed in recent literature can be used for this purpose.…”
Section: Coherence Modeling: Are Visual Low-level Features Sufficient?mentioning
confidence: 99%