High-Level Feature Detection from Video in TRECVid: A 5-Year Retrospective of Achievements

Smeaton, Alan F.; Over, Paul; Kraaij, Wessel

doi:10.1007/978-0-387-76569-3_6

Cited by 120 publications

(106 citation statements)

References 13 publications

Supporting

Mentioning

100

Contrasting

Unclassified

Order By: Relevance

“…This modern methodology facilitates an understanding of topic queries and low-level features by analysing the mapping in a semantic way. To build a largescale ontology and lexicon for semantic gap filling, large efforts have been made in activities like LSCOM (Large-Scale Concept Ontology for Multimedia), Naphade et al (2006); Kennedy and Hauptmann (2006), TRECVid, Smeaton et al (2009) and MediaMill's 101 concepts, Snoek et al (2006). Smeaton et al (2009) state that acceptable results have been achieved already within the TRECVid video retrieval evaluation framework for many cases particularly for concepts where there exists enough annotated training data.…”

Section: Annotating Lifelogs -Whatmentioning

confidence: 99%

“…To build a largescale ontology and lexicon for semantic gap filling, large efforts have been made in activities like LSCOM (Large-Scale Concept Ontology for Multimedia), Naphade et al (2006); Kennedy and Hauptmann (2006), TRECVid, Smeaton et al (2009) and MediaMill's 101 concepts, Snoek et al (2006). Smeaton et al (2009) state that acceptable results have been achieved already within the TRECVid video retrieval evaluation framework for many cases particularly for concepts where there exists enough annotated training data. Based on concept detection, encouraging improvement has been reported showing the efficiency and the effectiveness of concepts for higher level retrieval, Snoek et al (2006); Neo et al (2006).…”

Section: Annotating Lifelogs -Whatmentioning

confidence: 99%

See 1 more Smart Citation

LifeLogging: Personal Big Data

Gurrin

Smeaton

Doherty

2014

FNT in Information Retrieval

Self Cite

337

137

View full text Add to dashboard Cite

Section: Annotating Lifelogs -Whatmentioning

confidence: 99%

Section: Annotating Lifelogs -Whatmentioning

confidence: 99%

LifeLogging: Personal Big Data

Gurrin

Smeaton

Doherty

2014

FNT in Information Retrieval

Self Cite

337

137

View full text Add to dashboard Cite

“…High-level feature extraction results (average precision@2000 [6]) for both SIFT-based BoW features and features resulting from the concatenation of SIFT-and LIFT-based BoW are shown in Fig. 2.…”

Section: Resultsmentioning

confidence: 99%

“…Large scale video analysis for the purpose of high-level feature extraction, using local invariant features, is in most cases performed at the key-frame level [6]. Thus, the video analysis task reduces to still image analysis.…”

Section: Related Workmentioning

confidence: 99%

Local Invariant Feature Tracks for High-Level Video Feature Extraction

Mezaris

Dimou

Kompatsiaris

2012

Lecture Notes in Electrical Engineering

View full text Add to dashboard Cite

This paper builds upon previous work on local interest point detection and description to propose the extraction and representation of novel Local Invariant Feature Tracks (LIFT). These features compactly capture not only the spatial attributes of 2D local regions, as in SIFT and related techniques, but also their long-term trajectories in time. This and other desirable properties of LIFT allow the generation of Bags-of-Spatiotemporal-Words models that facilitate capturing the dynamics of video content, which is necessary for detecting high-level video features that by definition have a strong temporal dimension. Preliminary experimental evaluation and comparison of the proposed approach reveals promising results.

show abstract

“…This requires searching through lifelogs based on content, and for this the automatic detection of semantic concepts is needed. The conventional approach to content-based indexing, as taken in the annual TRECVid benchmarking [11,12], is to annotate a collection covering both positive and negative examples of the presence of each concept and then to train a machine learning classifier to recognize the presence of the concept. This typically requires a classifier for each concept without considering inter-concept relationships or dependencies yet in reality, many concept pairs will co-occur rather than occur independently.…”

Section: Introductionmentioning

confidence: 99%

Semantically Smoothed Refinement for Everyday Concept Indexing

Wang

Sun

Smeaton

2016

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. Instead of occurring independently, semantic concepts pairs tend to co-occur within a single image and it is intuitive that concept detection accuracy for visual concepts can be enhanced if concept correlation can be leveraged in some way. In everyday concept detection for visual lifelogging using wearable cameras to automatically record everyday activities, the captured images usually have a diversity of concepts which challenges the performance of concept detection. In this paper a semantically smoothed refinement algorithm is proposed using concept correlations which exploit topic-related concept relationships, modeled externally in a user experiment rather than extracted from training data. Results for initial concept detection are factorized based on semantic smoothness and adjusted in compliance with the extracted concept correlations. Refinement performance is demonstrated in experiments to show the effectiveness of our algorithm and the extracted correlations.

show abstract

High-Level Feature Detection from Video in TRECVid: A 5-Year Retrospective of Achievements

Cited by 120 publications

References 13 publications

LifeLogging: Personal Big Data

LifeLogging: Personal Big Data

Local Invariant Feature Tracks for High-Level Video Feature Extraction

Semantically Smoothed Refinement for Everyday Concept Indexing

Contact Info

Product

Resources

About