Video Copy Detection Using a Soft Cascade of Multimodal Features

Jiang, Menglin; Huang, Tiejun

doi:10.1109/icme.2012.189

Cited by 16 publications

(6 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Poullot et al [37] introduced the Temporal Matching Kernel (TMK) that encodes sequences of frames with periodic kernels that take into account the frame descriptor and timestamp. A score function was introduced for video matching that maximizes both the similarity score and the [20] 0.962 Tian et al, 2015 [52] 0.952 Chou et al, 2015 [9] 0.938 Table 4.3: Multimodal approach and F1 score on TRECVID 2011 of four filter-andrefine matching methods. If the approach is not multimodal, then the F1 score is calculated based on the video transformations only.…”

Section: Researchmentioning

confidence: 99%

“…A sequential pyramid matching (SPM) algorithm was devised to localize the similar video sequences. In contrast, Jiang et al [20] presented a soft cascade framework utilizing multiple hashed features to filter out non-NDVs. They modified the SPM to introduce temporal information in a temporal pyramid matching (TPM).…”

Section: Filter-and-refine Matchingmentioning

confidence: 99%

“…To further improve performance, they proposed in [50] a multi-scale sequence matching method by LSH using WASF, DCT, and the dense color version SIFT (DC-SIFT), combined with TPM to match near-duplicate segments. Including the concept of transformationawareness, copy units, and soft decision boundary, Tian et al [52] extended the multimodal detector cascading framework [20], [50] to a more general approach. Chou et al [9] proposed a spatio-temporal indexing structure utilizing index patterns, termed Pattern-based Index Tree (PI-tree), to early filter non-near-duplicate videos.…”

Section: Filter-and-refine Matchingmentioning

confidence: 99%

See 2 more Smart Citations

Finding Near-Duplicate Videos in Large-Scale Collections

Kordopatis-Zilos

Papadopoulos

Patras

et al. 2019

Video Verification in the Fake News Era

View full text Add to dashboard Cite

This chapter discusses the problem of Near-Duplicate Video Retrieval (NDVR). The main objective of a typical NDVR approach is: given a query video, retrieve all near-duplicate videos in a video repository and rank them based on their similarity to the query. Several approaches have been introduced in the literature, which can be roughly classified in three categories based on the level of video matching, i.e. (i) video-level, (ii) frame-level and (iii) filter-and-refine matching. Two methods based on video-level matching are presented in this chapter. The first is an unsupervised scheme that relies on a modified Bag-of-Word (BoW) video representation. The second is a supervised method based on Deep Metric Learning (DML). For the development of both methods, features are extracted from the intermediate layers of Convolutional Neural Networks and leveraged as frame descriptors, since they offer a compact and informative image representation, and lead to increased system efficiency. Extensive evaluation has been conducted on publicly available benchmark datasets, and the presented methods are compared with state-of-art approaches, achieving the best results in all evaluation setups.

show abstract

Section: Researchmentioning

confidence: 99%

Section: Filter-and-refine Matchingmentioning

confidence: 99%

Section: Filter-and-refine Matchingmentioning

confidence: 99%

See 1 more Smart Citation

Finding Near-Duplicate Videos in Large-Scale Collections

Kordopatis-Zilos

Papadopoulos

Patras

et al. 2019

Video Verification in the Fake News Era

View full text Add to dashboard Cite

show abstract

“…多特征哈希 [1] 以及随机多角度哈希 [2] 这两种最新的基于哈希的近重复视频检索方法都是基于多特征融合的策略. 此外, Jiang 等 [13] 利用时域金字塔匹配结构融合多特征, 构建视频拷贝检测系统; Nie 等 [14,15] 由以上描述可知, 本文方法中的中间层和高层语义特征均来自于深度学习模型, 众所周知, 近年来, 相关研究者提出了很多深度卷积神经网络模型, 如 VGGNet [18] , AlexNet [19] 和 GoogLeNet [20]…”

Section: 相关工作unclassified

Hierarchical feature fusion hashing for near-duplicate video retrieval

Nie¹,

Lin²,

Yang³

et al. 2018

Sci. Sin.-Inf.

View full text Add to dashboard Cite

“…If the query was not a copy it was passed to the second layer which was the DCT detector and only declared as a copy if the DCT found a match, otherwise it was finally passed to the DCSIFT as the final layer. For details see [Jian et al 2011] and [Jiang et al 2012].…”

Section: Pku-idmmentioning

confidence: 99%

Content-Based Video Copy Detection Benchmarking at TRECVID

Awad

Over

Kraaij

2014

ACM Trans. Inf. Syst.

View full text Add to dashboard Cite

This paper presents an overview of the video copy detection benchmark which was run over a period of 4 years (2008)(2009)(2010)(2011) as part of the TREC Video Retrieval (TRECVID) workshop series. The main contributions of the paper include i) an examination of the evolving design of the evaluation framework and its components (system tasks, data, measures); ii) a high-level overview of results and best-performing approaches; and iii) a discussion of lessons learned over the four years. The content-based copy detection (CCD) benchmark worked with a large collection of synthetic queries, which is atypical for TRECVID, as was the use of a normalized detection cost framework. These particular evaluation design choices are motivated and appraised.

show abstract

Video Copy Detection Using a Soft Cascade of Multimodal Features

Cited by 16 publications

References 8 publications

Finding Near-Duplicate Videos in Large-Scale Collections

Finding Near-Duplicate Videos in Large-Scale Collections

Hierarchical feature fusion hashing for near-duplicate video retrieval

Content-Based Video Copy Detection Benchmarking at TRECVID

Contact Info

Product

Resources

About