Proceedings of the 29th ACM International Conference on Multimedia 2021
DOI: 10.1145/3474085.3475549
|View full text |Cite
|
Sign up to set email alerts
|

Video Similarity and Alignment Learning on Partial Video Copy Detection

Abstract: Existing video copy detection methods generally measure video similarity based on spatial similarities between key frames, neglecting the latent similarity in temporal dimension, so that the video similarity is biased towards spatial information. There are methods modeling unified video similarity in an end-to-end way, but losing detailed partial alignment information, which causes the incapability of copy segments localization. To address the above issues, we propose the Video Similarity and Alignment Learnin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(11 citation statements)
references
References 27 publications
0
11
0
Order By: Relevance
“…We have also removed the optimization components for a strongest accuracy. The F 1 score has been used as it is common to characterize the PVCD methods [9,4,19,6,5,15].…”
Section: Performance Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…We have also removed the optimization components for a strongest accuracy. The F 1 score has been used as it is common to characterize the PVCD methods [9,4,19,6,5,15].…”
Section: Performance Evaluationmentioning
confidence: 99%
“…It is a well-known topic in the computer vision field [12]. The recent works investigate the detection methods robust to the spatial & temporal deformations [6,5,15] or real-time [4,13,19]. A key aspect for any computer vision task is to design public datasets for performance evaluation.…”
Section: Introductionmentioning
confidence: 99%
“…Previous segment-level evaluation metrics are introduced with MUSCLE-VCD [15] and VCDB datasets [11]. Most of recent research works [7][8][9] adopt segment precision and recall defined in VCDB as follows:…”
Section: Datasets and Evaluationmentioning
confidence: 99%
“…In most cases, video-level copy detection results alone are not sufficient as the detected videos are usually displayed and interacted with system users for downstream tasks. Hence, designing an approach that can locate the copied segments is preferred and has already attracted lots of attentions in recent works [7][8][9][10][11].…”
Section: Introductionmentioning
confidence: 99%
“…Kordopatis et al [10] calculate video-to-video similarity by refined frame-to-frame similarity matrices. Han et.al [11] modelled the video similarity as the mask map predicted from frame-level spatial similarity.…”
Section: Related Workmentioning
confidence: 99%