2016
DOI: 10.1016/j.neucom.2015.09.074
|View full text |Cite
|
Sign up to set email alerts
|

A novel hierarchical Bag-of-Words model for compact action representation

Abstract: Bag-of-Words (BoW) histogram of local space-time features is very popular for action representation due to its high compactness and robustness. However, its discriminant ability is limited since it only depends on the occurrence statistics of local features. Alternative models such as Vector of Locally Aggregated Descriptors (VLAD) and Fisher Vectors (FV) include more information by aggregating high-dimensional residual vectors, but they suffer from the problem of high dimensionality for final representation. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 15 publications
(3 citation statements)
references
References 46 publications
0
3
0
Order By: Relevance
“…The vector is compact and is invariant to translation, rotation, and scaling [21]. However, where the codebook size is very large, the dimension of the image vector can also become very large [112]. This will call for the inclusion of dimension reduction methods; thereby achieve better retrieval results than BOVW at a high computational cost [113].…”
Section: Image Representationsmentioning
confidence: 99%
“…The vector is compact and is invariant to translation, rotation, and scaling [21]. However, where the codebook size is very large, the dimension of the image vector can also become very large [112]. This will call for the inclusion of dimension reduction methods; thereby achieve better retrieval results than BOVW at a high computational cost [113].…”
Section: Image Representationsmentioning
confidence: 99%
“…Containing the term will be represented as '1', while the absence will be represented as '0'. Now each data instance will be represented as a vector like the following vector: 00001000000 in this regard, each vector will contains a value of '1' which refers to the occurrence of a corresponding term [20,21].…”
Section: N-gram Representationmentioning
confidence: 99%
“…Bag-of-Words (BoW) is a classical mid-level feature representation framework which has demonstrated excellent performances in computer vision tasks such as image classification [13], [14] and action classification [15]- [17]. However, the traditional spatial pyramid widely used in BoW [13], [14] does not consider body structure information which is important prior information for person re-identification.…”
Section: Introductionmentioning
confidence: 99%