Motion-decision based spatiotemporal saliency for video sequences

Zhu, Yaping; Jacobson, Natan; Hong, Pan; Nguyen, Truong Q.

doi:10.1109/icassp.2011.5946658

Cited by 6 publications

(2 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Motion Distribution is a significant feature as many previous works have indicated that commercial shots mostly have high motion content as they try to convey maximum information in minimum possible time. This motivates us to compute dense optical flow (Horn-Schunk formulation) between consecutive frames and construct a distribution of flow magnitudes over the entire shot with 40 uniformly divided bins in range of [0, 40] [5], [30]. Often pixel intensities of regions suddenly change while the boundaries of the region do not move.…”

Section: Audio-visual Featuresmentioning

confidence: 99%

“…We have used existing features from the literature viz. shot length [29], scene motion distribution [5], [30], overlay text distribution [8], zero crossing rate [31], [6], short time energy (STE) [6], fundamental frequency, spectral centroid, flux and roll-off frequency [8] and MFCC Bag of Words [32]. We observed that, SVMs trained on a certain set of features fail to detect the commercial shots when ever the basic assumption involving those features are violated.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

TV Commercial Detection Using Success Based Locally Weighted Kernel Combination

Kannao

Guha

2016

MultiMedia Modeling

View full text Add to dashboard Cite

Commercial detection in news broadcast videos involves judicious selection of meaningful audio-visual feature combinations and efficient classifiers. And, this problem becomes much simpler if these combinations can be learned from the data. To this end, we propose an Multiple Kernel Learning based method for boosting successful kernel functions while ignoring the irrelevant ones. We adopt a intermediate fusion approach where, a SVM is trained with a weighted linear combination of different kernel functions instead of single kernel function. Each kernel function is characterized by a feature set and kernel type. We identify the feature sub-space locations of the prediction success of a particular classifier trained only with particular kernel function. We propose to estimate a weighing function using support vector regression (with RBF kernel) for each kernel function which has high values (near 1.0) where the classifier learned on kernel function succeeded and lower values (nearly 0.0) otherwise. Second contribution of this work is TV News Commercials Dataset of 150 Hours of News videos. Classifier trained with our proposed scheme has outperformed the baseline methods on 6 of 8 benchmark dataset and our own TV commercials dataset.

show abstract

Section: Audio-visual Featuresmentioning

confidence: 99%