A Two-View Concept Correlation Based Video Annotation Refinement

Zhong, Cencen; Zhang, Miao

doi:10.1109/lsp.2012.2189386

Cited by 1 publication

(3 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Annotation refinement: accuracy of automatic concept detection suffers due to the high variability in the visual characteristics and, hence, it is a common practice to minimize the effect of the error by refining obtained annotations. Several works use a popular strategy called Content-Based Concept Fusion(CBCF) (Zhong & Miao, 2012) to achieve this. In its simplest form, CBCF uses some common concept coocurrence reference to evaluate the quality of candidate annotations within shots.…”

Section: Automatic Shot Annotation Requirementsmentioning

confidence: 99%

“…Therefore, approaches that use within-shot correlation as well as temporal correlation are adopted. For instance, authors of (Zhong & Miao, 2012) propose the following method. Given a shot x t | t = 1, .…”

Section: Automatic Shot Annotation Requirementsmentioning

confidence: 99%

“…To refine the shot annotations resulting from the classification, we use a modified version of the strategy prposed by authors of (Zhong & Miao, 2012). Temporal refinement term is calculated over a window of a 10 shots and spatial refinement is done using WordNet and DBPedia based voting.…”

Section: Automatic Shot Annotationmentioning

confidence: 99%

See 2 more Smart Citations

Fine‐granularity semantic video annotation

El-Khoury

Jergler

Bayou

et al. 2013

International Journal of Pervasive Computing and Communications

View full text Add to dashboard Cite

A fine-grained video content indexing, retrieval, and adaptation requires accurate metadata describing its structure and semantics to the lowest granularity, i.e., the object level. We address these requirements by proposing Semantic Video Content Annotation Tool (SVCAT) for structural and high-level semantic annotation. SVCAT is a semi-automatic MPEG-7 standard compliant annotation tool, which produces metadata according to a new object-based video content model. Videos are temporally segmented into shots and shots level concepts are detected automatically using ImageNet as a background knowledge. These concepts are used as a guide to easily locate and select objects of interest which can be tracked automatically. The integration of shot based concept detection with object localization and tracking drastically alleviates the task of an annotator. As such, SVCAT enables to easily generate selective and fine-grained metadata which are vital for user centric object level semantic video operations such as product placement or obscene material removal. Experimental results show that SV-CAT is able to provide accurate object level video metadata.

show abstract

Section: Automatic Shot Annotation Requirementsmentioning

confidence: 99%

Section: Automatic Shot Annotation Requirementsmentioning

confidence: 99%