2019
DOI: 10.1145/3355398
|View full text |Cite
|
Sign up to set email alerts
|

Survey of Compressed Domain Video Summarization Techniques

Abstract: Video summarization is the method of extracting key frames or clips from a video to generate a synopsis of the content of the video. Generally, video is compressed before storing or transmitting it in most of the practical applications. Traditional techniques require the videos to be decoded to summarize them, which is a tedious job. Instead, compressed domain video processing can be used for summarizing videos by partially decoding them. A classification and analysis of various summarization techniques are pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 42 publications
(17 citation statements)
references
References 116 publications
0
9
0
Order By: Relevance
“…The key to this approach is the vectorization of content because it matches based on the similarities in vectorized content. Examples of vectorization methods include video and description summarization [31,32] and image captioning [33][34][35]. In particular, the authors of [36] matched poetry and images through captioning, the authors of [33] presented an automatic caption generation method for Impressionist artworks for people with visual impairments, and the authors of [28] used emotional features in music recommendations.…”
Section: Sensing Technologymentioning
confidence: 99%
See 1 more Smart Citation
“…The key to this approach is the vectorization of content because it matches based on the similarities in vectorized content. Examples of vectorization methods include video and description summarization [31,32] and image captioning [33][34][35]. In particular, the authors of [36] matched poetry and images through captioning, the authors of [33] presented an automatic caption generation method for Impressionist artworks for people with visual impairments, and the authors of [28] used emotional features in music recommendations.…”
Section: Sensing Technologymentioning
confidence: 99%
“…The hyper-parameters of training were obtained via a greedy search. The ranges of the greedy searches were as follows: The batch sizes were [8,16,32], and the initial learning rates were [1 × 10 −4 , 2 × 10 −4 , . .…”
Section: Audio Feature Extraction Via the Multi-time-scale Transformmentioning
confidence: 99%
“…Kind of text summary (KTS): The most widely discussed distinction for text summarization works is the distinction of extractive vs abstractive. Similarly, depending on the nature of an output text summary, we can also classify the works in MMS tasks (containing text in the output) into extractive MMS [13,[44][45][46]58] and abstractive MMS [12,57,133,134] 6 .…”
Section: Content Intensity (Ci)mentioning
confidence: 99%
“…Although quite a few survey papers were written for uni-modal summarization tasks including surveys on text summarization [31,32,81,112,124] and video summarization [6,41,52,76,102], and a few survey papers covering multi-modal research [3,4,43,90,103,107]. However, to the best of our knowledge, we are the first to present a survey on multi-modal summarization.…”
Section: Introductionmentioning
confidence: 99%
“…The more recent study of Molino et al [8] focuses on egocentric video summarization and discusses the specifications and the challenges of this task. In another recent work, Basavarajaiah et al [9] provide a classification of various summarization approaches, including some recently-proposed deep-learning-based methods; however, it mainly focuses on summarization algorithms that are directly applicable on the compressed domain. Finally, another recent survey of Vivekraj et al [10] presents the relevant bibliography based on a twoway categorization, that relates to the utilized data modalities during the analysis and the incorporation of human aspects.…”
Section: Introductionmentioning
confidence: 99%