Comparing segmentation strategies for efficient video passage retrieval

Wartena, Christian

doi:10.1109/cbmi.2012.6269850

Cited by 9 publications

(15 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Surprisingly, in text retrieval, the majority of authors (e.g., [31,3,13,28]) show that the segmentation using sliding windows and creating overlapping segments of a regular length is the most successful approach to segmentation and its subsequent usage in IR. It is also demonstrated that this approach is sensitive to the window length which needs to be tuned on training data.…”

Section: Passage Retrievalmentioning

confidence: 99%

“…Segmentation for audio-visual recordings has not been widely studied; some experiments were performed by Eskevich et al [7] and Wartena [31]. The Story Segmentation Task of TRECVID 2003 [24] and 2004 [25] focused on identification of story boundaries in video recordings, however, the detected boundaries were not subsequently used for IR.…”

Section: Passage Retrievalmentioning

confidence: 99%

See 1 more Smart Citation

Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visual Documents

Galuščáková

Pecina

2014

Proceedings of International Conference on Multimedia Retrieval

View full text Add to dashboard Cite

This paper deals with Information Retrieval from audiovisual recordings. Such recordings are often quite long and users may want to find the exact starting points of relevant passages they search for. In Passage Retrieval, the recordings are automatically segmented into smaller parts, on which the standard retrieval techniques are applied. In this paper, we discuss various techniques for segmentation of audio-visual recordings and focus on machine learning approaches which decide on segment boundaries based on various features combined in a decision-tree model. Our experiments are carried out on the data used for the Search and Hyperlinking Task and Similar Segments in Social Speech Task of the MediaEval Benchmark 2013.

show abstract

Section: Passage Retrievalmentioning

confidence: 99%

Section: Passage Retrievalmentioning

confidence: 99%

Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visual Documents

Galuščáková

Pecina

2014

Proceedings of International Conference on Multimedia Retrieval

View full text Add to dashboard Cite

show abstract

“…[3,5]). Our experiments performed on the test collection used in the Search subtask of the Search and Hyperlinking Task in MediaEval Benchmarking 2012 1 confirm those findings and show that parameters (segment length and shift) tuning for a specific test collection can further improve the results.…”

mentioning

confidence: 97%

Segmentation strategies for passage retrieval in audio-visual documents

Galuščáková

2013

Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval

View full text Add to dashboard Cite

The importance of Information Retrieval (IR) in audio-visual recordings has been increasing with steeply growing numbers of audio-visual documents available on-line. Compared to traditional IR methods, this task requires specific techniques, such as Passage Retrieval which can accelerate the search process by retrieving the exact relevant passage of a recording instead of the full document. In Passage Retrieval, full recordings are divided into shorter segments which serve as individual documents for the further IR setup. This technique also allows normalizing document length and applying positional information. It was shown that it can even improve retrieval results (e.g. [3]).In this work, we examine two general strategies for Passage Retrieval: blind segmentation into overlapping regularlength passages and segmentation into variable-length passages based on semantics of their content.Time-based segmentation was already shown to improve retrieval of textual documents and audio-visual recordings (e.g. [3,5]). Our experiments performed on the test collection used in the Search subtask of the Search and Hyperlinking Task in MediaEval Benchmarking 2012 1 confirm those findings and show that parameters (segment length and shift) tuning for a specific test collection can further improve the results. Our best results on this collection were achieved by using 45-second long segments with 15-second shifts.Semantic-based segmentation can be divided into three types: similarity-based (producing segments with high intrasimilarity and low inter-similarity), lexical-chain-based (producing segments with frequent lexically connected words), and feature-based (combining various features which signalize a segment break in a machine-learning setting) [4]. In this work, we mainly focus on feature-based segmentation which allows exploiting various features from all modalities of the data (including segment length) in a single trainable model and produces segments which can eventually overlap.

show abstract

“…We focus on the behaviour of the vocabulary of individual queries and content of the segment units. A more quantitative study analyzing different aspects of the text and the segments can be found in [17].…”

Section: Relationship Between Retrieval Effectiveness and Segmentatiomentioning

confidence: 99%

“…For the segmentation stage, fragments were defined as a sequence of sentences of n non-stop-words. In this investigation n = 20, 40 were used, more on n variations in [17]. Sentences were derived on the basis of punctuation (fullstop = sentence end) hypothesized by the ASR system and included in the transcript.…”

Section: Sliding Window (Sw)mentioning

confidence: 99%

Comparing retrieval effectiveness of alternative content segmentation methods for Internet video search

Eskevich

Jones

Wartena

et al. 2012

2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)

Self Cite

View full text Add to dashboard Cite

We present an exploratory study of the retrieval of semiprofessional user-generated Internet video. The study is based on the MediaEval 2011 Rich Speech Retrieval (RSR) task for which the dataset was taken from the Internet sharing platform blip.tv, and search queries associated with specific speech acts occurring in the video. We compare results from three participant groups using: automatic speech recognition system transcript (ASR), metadata manually assigned to each video by the user who uploaded it, and their combination. RSR 2011 was a known-item search for a single manually identified ideal jump-in point in the video for each query where playback should begin. Retrieval effectiveness is measured using the MRR and mGAP metrics. Using different transcript segmentation methods the participants tried to maximize the rank of the relevant item and to locate the nearest match to the ideal jump-in point. Results indicate that best overall results are obtained for topically homogeneous segments which have a strong overlap with the relevant region associated with the jump-in point, and that use of metadata can be beneficial when segments are unfocused or cover more than one topic.

show abstract

Comparing segmentation strategies for efficient video passage retrieval

Cited by 9 publications

References 19 publications

Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visual Documents

Experiments with Segmentation Strategies for Passage Retrieval in Audio-Visual Documents

Segmentation strategies for passage retrieval in audio-visual documents

Comparing retrieval effectiveness of alternative content segmentation methods for Internet video search

Contact Info

Product

Resources

About