2022
DOI: 10.48550/arxiv.2207.08352
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Show Me What I Like: Detecting User-Specific Video Highlights Using Content-Based Multi-Head Attention

Uttaran Bhattacharya,
Gang Wu,
Stefano Petrangeli
et al.

Abstract: We propose a method to detect individualized highlights for users on given target videos based on their preferred highlight clips marked on previous videos they have watched. Our method explicitly leverages the contents of both the preferred clips and the target videos using pretrained features for the objects and the human activities. We design a multi-head attention mechanism to adaptively weigh the preferred clips based on their object-and humanactivity-based contents, and fuse them using these weights into… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 45 publications
0
1
0
Order By: Relevance
“…The marked multi-head attention is a hidden multihead attention layer. The mask denotes a matrix to the scaled attention scores of the multi-head attention layer, see more in Bhattacharya et al 47 MultiHead(Q, K, V) = Concat head 1 , . .…”
Section: Ship Motion Dynamics Estimation Transformermentioning
confidence: 99%
“…The marked multi-head attention is a hidden multihead attention layer. The mask denotes a matrix to the scaled attention scores of the multi-head attention layer, see more in Bhattacharya et al 47 MultiHead(Q, K, V) = Concat head 1 , . .…”
Section: Ship Motion Dynamics Estimation Transformermentioning
confidence: 99%