2018
DOI: 10.1016/j.jvcir.2018.02.013
|View full text |Cite
|
Sign up to set email alerts
|

Making a long story short: A multi-importance fast-forwarding egocentric videos with the emphasis on relevant objects

Abstract: The emergence of low-cost high-quality personal wearable cameras combined with the increasing storage capacity of video-sharing websites have evoked a growing interest in first-person videos, since most videos are composed of longrunning unedited streams which are usually tedious and unpleasant to watch. State-of-the-art semantic fast-forward methods currently face the challenge of providing an adequate balance between smoothness in visual flow and the emphasis on the relevant parts. In this work, we present t… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
35
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 14 publications
(35 citation statements)
references
References 22 publications
0
35
0
Order By: Relevance
“…In this section, we present the quantitative results of the experimental evaluation of the proposed method. We compare it with the methods: EgoSampling (ES) [22], Stabilized Semantic Fast-Forward (SSFF) [27], Microsoft Hyperlapse (MSH) [7] the state-of-the-art method in terms of visual smoothness, and Multi-Importance Fast-Forward (MIFF) [26] the state-of-the-art method in terms of the amount of semantics retained in the final video. Figure 4-a shows the results of the Semantic evaluation performed using the sequences in the Semantic Dataset.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…In this section, we present the quantitative results of the experimental evaluation of the proposed method. We compare it with the methods: EgoSampling (ES) [22], Stabilized Semantic Fast-Forward (SSFF) [27], Microsoft Hyperlapse (MSH) [7] the state-of-the-art method in terms of visual smoothness, and Multi-Importance Fast-Forward (MIFF) [26] the state-of-the-art method in terms of the amount of semantics retained in the final video. Figure 4-a shows the results of the Semantic evaluation performed using the sequences in the Semantic Dataset.…”
Section: Comparison With State-of-the-art Methodsmentioning
confidence: 99%
“…The rates are calculated such that the semantic segments are played slower than the nonsemantic ones, and the whole video achieves the desired speed-up. We refer the reader to [26] for a more detailed description of the multi-importance semantic segmentation.…”
Section: Temporal Semantic Profile Segmentationmentioning
confidence: 99%
See 1 more Smart Citation
“…In this work, we proposed the CoolNet, a Convolutional Neural Network that learns the preference of the user from visual data of frames of YouTube videos in the YouTube8M dataset [29] and their statistics (number of views, likes, and dislikes). The readers is referred to our work [30] to details about the dataset creation, training routines, and model accuracy. The created semantic profile is used for segmenting the input video into sequences of different levels of semantic, and to compute speed-up rates such that it slows down the video portions according with their semantic load.…”
Section: A Temporal Semantic Profile Segmentationmentioning
confidence: 99%
“…The training process has used only restricted amount of task-specific training data. Jain et al [46,47] used CNN features for visual detection tasks, for example, object localization, scene identification, and classification. Alom et al [40] used cellular simultaneous recurrent networks (CSRNs) for feature extraction.…”
Section: Introductionmentioning
confidence: 99%