2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2011
DOI: 10.1109/icassp.2011.5946658
|View full text |Cite
|
Sign up to set email alerts
|

Motion-decision based spatiotemporal saliency for video sequences

Abstract: An adaptive spatiotemporal saliency algorithm for video attention detection using motion vector decision is proposed, motivated by the importance of motion information in video sequences for human visual system. This novel system can detect the saliency regions quickly by using only part of the classic saliency features in each iteration. Motion vectors calculated by block matching and optical flow are used to determine the decision condition. When significant motion contrast occurs (decision condition is sati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…The Motion Distribution is a significant feature as many previous works have indicated that commercial shots mostly have high motion content as they try to convey maximum information in minimum possible time. This motivates us to compute dense optical flow (Horn-Schunk formulation) between consecutive frames and construct a distribution of flow magnitudes over the entire shot with 40 uniformly divided bins in range of [0, 40] [5], [30]. Often pixel intensities of regions suddenly change while the boundaries of the region do not move.…”
Section: Audio-visual Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…The Motion Distribution is a significant feature as many previous works have indicated that commercial shots mostly have high motion content as they try to convey maximum information in minimum possible time. This motivates us to compute dense optical flow (Horn-Schunk formulation) between consecutive frames and construct a distribution of flow magnitudes over the entire shot with 40 uniformly divided bins in range of [0, 40] [5], [30]. Often pixel intensities of regions suddenly change while the boundaries of the region do not move.…”
Section: Audio-visual Featuresmentioning
confidence: 99%
“…We have used existing features from the literature viz. shot length [29], scene motion distribution [5], [30], overlay text distribution [8], zero crossing rate [31], [6], short time energy (STE) [6], fundamental frequency, spectral centroid, flux and roll-off frequency [8] and MFCC Bag of Words [32]. We observed that, SVMs trained on a certain set of features fail to detect the commercial shots when ever the basic assumption involving those features are violated.…”
Section: Introductionmentioning
confidence: 99%