2016
DOI: 10.1109/tmm.2016.2559947
|View full text |Cite
|
Sign up to set email alerts
|

TagBook: A Semantic Video Representation Without Supervision for Event Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 38 publications
(18 citation statements)
references
References 54 publications
0
18
0
Order By: Relevance
“…is raises new challenges in searching both within and across videos. e problem of making videos content more accessible has spurred research in automatic tagging [2,39,51] and video summarization [1,15,26,27,31,36,49,57,69]. In automatic tagging, the goal is to predict meta-data in form of tags, which makes videos searchable via text queries.…”
Section: Introductionmentioning
confidence: 99%
“…is raises new challenges in searching both within and across videos. e problem of making videos content more accessible has spurred research in automatic tagging [2,39,51] and video summarization [1,15,26,27,31,36,49,57,69]. In automatic tagging, the goal is to predict meta-data in form of tags, which makes videos searchable via text queries.…”
Section: Introductionmentioning
confidence: 99%
“…Merler et al [25] and Ma et al [26] utilize external images and videos to build an intermediate level video representation for event detection. Mazloom et al [27] learn a video descriptor based on the tags of their nearest neighbors in a large collection of social tagged videos. Song et al [28] extract key segments for event detection by transferring concept knowledge from web images and videos.…”
Section: A Related Workmentioning
confidence: 99%
“…For instance, [12], [61] conduct unsupervised representation learning by reinforcing the visual representations generated from hand-craft features through the use of freely available social tags or text descriptions of web videos. Neural networks are used to construct unsupervised feature representations via auto-encoders [62], [63] and restricted Boltzmann machines (RBMs) [64].…”
Section: B Representation Learningmentioning
confidence: 99%