Detecting Semantic Concepts from Video Using Temporal Gradients and Audio Classification

Rautiainen, Mika; Seppänen, Tapio; Penttilä, Jani; Peltola, Johannes

doi:10.1007/3-540-45113-7_26

Cited by 6 publications

(9 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In [6] we observed that several concepts coexist and correlate in a video, which is not suitable for multi-class classifiers. Our approach is to have several simplified concept detectors that are trained using small sets of positive example shots, each propagating labels to their nearest neighbors in selected feature spaces.…”

Section: Detecting Semantic Conceptsmentioning

confidence: 96%

“…TGC, initially used in the detector experiments in [6], describes spatial correlation of edge orientations in an autocorrelogram. The feature is computed from the 20 temporally sampled video frames in a shot.…”

Section: Low-level Featuresmentioning

confidence: 99%

“…However, training of multiple classifiers for large concept lexicon can be tedious. A fast and simple method to build concept detectors was introduced in [6], where detectors were trained by selecting only small sets of positive examples for every concept.…”

Section: Introductionmentioning

confidence: 99%

“…This paper presents extended experiments with visual detectors for 12 semantic concepts from TRECVID 2003 semantic feature detection task [13]. The detectors use low-level visual features that measure video motion activity and spatial correlations of image gradients and colors.…”

Section: Introductionmentioning

confidence: 99%

“…The detectors use low-level visual features that measure video motion activity and spatial correlations of image gradients and colors. In comparison to prior research on visual detectors [13] [6] this paper reports experiments with broader sets of concepts, low-level features, fusion techniques, training set sizes and larger test database. Section 2 describes selected low-level features and the fusion operations used in concept detectors.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Comparison of Visual Features and Fusion Techniques in Automatic Detection of Concepts from News Video

Rautiainen

Seppdnen

2005

2005 IEEE International Conference on Multimedia and Expo

Self Cite

View full text Add to dashboard Cite

This study describes experiments on automatic detection of semantic concepts, which are textual descriptions about the digital video content. The concepts can be further used in content-based categorization and access of digital video repositories. Temporal Gradient Correlograms, Temporal Color Correlograms and Motion Activity low-level features are extracted from the dynamic visual content of a video shot. Semantic concepts are detected with an expeditious method that is based on the selection of small positive example sets and computational low-level feature similarities between video shots. Detectors using several feature and fusion operator configurations are tested in 60-hour news video database from TRECVID 2003 benchmark. Results show that the feature fusion based on ranked lists gives better detection performance than fusion of normalized low-level feature spaces distances. Best performance was obtained by pre-validating the configurations of features and rank fusion operators. Results also show that minimum rank fusion of temporal color and structure provides comparable performance.

show abstract

Section: Detecting Semantic Conceptsmentioning

confidence: 96%

Section: Low-level Featuresmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Comparison of Visual Features and Fusion Techniques in Automatic Detection of Concepts from News Video

Rautiainen

Seppdnen

2005

2005 IEEE International Conference on Multimedia and Expo

Self Cite

View full text Add to dashboard Cite

show abstract

Cluster-temporal browsing of large news video databases

Rautiainen

Ojala

Seppänen

2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763)

View full text Add to dashboard Cite

This paper describes cluster-temporal browsing of news video database. Cluster-temporal browsing combines content similarities and temporal adjacency into single representation. Visual, conceptual and lexical features are used to organize and view similar shot content. Interactive experiments with eight test users have been carried out using a database of roughly 60 hours of news video. Results indicate improvemenfs in browsing efficiency when automatic speech recognition transcripts are incorporated into browsing by visual similarity. Cluster-temporal browsing application received positive comments from the test users and performed well in overall comparison of interactive video retrieval systems in TRECVID 2003 evaluation.

show abstract

Advancing Content-Based Retrieval Effectiveness with Cluster-Temporal Browsing in Multilingual Video Databases

Rautiainen¹,

Seppänen²,

Ojala³

2006

2006 IEEE International Conference on Multimedia and Expo

Self Cite

View full text Add to dashboard Cite

Interactive experiments on video retrieval systems need to address the problem of internal validity, i.e. how much the test users' experience affects the retrieval effectiveness. This paper compares the semantic retrieval performance of novice users and expert system developers. The test system utilizes cluster-temporal browsing, which combines chronological video structure and computation of similarities into single interface. Interactive experiments with eight test users were carried out in a database of ~80 hours of multilingual news video from TRECVID 2005 benchmark. A cluster-temporal browser was found to improve the retrieval effectiveness by 12% with novice system users. Expert users were able to achieve 18% better performance than the novice users. Additionally, manual search experiments demonstrated that search performance can be improved by 19-25% when a plain text search is supplemented with content-based features.

show abstract

Detecting Semantic Concepts from Video Using Temporal Gradients and Audio Classification

Cited by 6 publications

References 17 publications

Comparison of Visual Features and Fusion Techniques in Automatic Detection of Concepts from News Video

Comparison of Visual Features and Fusion Techniques in Automatic Detection of Concepts from News Video

Cluster-temporal browsing of large news video databases

Advancing Content-Based Retrieval Effectiveness with Cluster-Temporal Browsing in Multilingual Video Databases

Contact Info

Product

Resources

About