Intelligent Multi‐modal Data Processing 2021
DOI: 10.1002/9781119571452.ch5
|View full text |Cite
|
Sign up to set email alerts
|

Video Summarization Using a Dense Captioning (DenseCap) Model

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 26 publications
0
0
0
Order By: Relevance
“…Additionally, they employed a proposal generator primarily reliant on video features, which deviated from the core concept of the dense video captioning task. Das et al [42] proposed a model that starts by producing region captions as its primary output. These region captions are then subjected to our clustering technique, resulting in the creation of sentence clusters.…”
Section: B Multi-modal Dense Video Captioningmentioning
confidence: 99%
“…Additionally, they employed a proposal generator primarily reliant on video features, which deviated from the core concept of the dense video captioning task. Das et al [42] proposed a model that starts by producing region captions as its primary output. These region captions are then subjected to our clustering technique, resulting in the creation of sentence clusters.…”
Section: B Multi-modal Dense Video Captioningmentioning
confidence: 99%