Improving bag-of-features action recognition with non-local cues

Ullah, Muhammad Muneeb; Parizi, Sobhan Naderi; Laptev, Ivan

doi:10.5244/c.24.95

Cited by 92 publications

(76 citation statements)

References 23 publications

(35 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Table 2 shows that our approach outperforms all other methods reported in literature so far. In particular, results are better than those of Ullah et al [7], who used a person detector, and Vig et al [14], who built upon saliency from an eye tracking system. A detailed analysis of the contribution of each set of features is presented in Table 3.…”

Section: Methodsmentioning

confidence: 45%

“…However, such failure cases are quite rare, and they are outnumbered by those cases where current person detectors fail. Hence, compared to [7], we achieve much better performance. We also compare to [14], where an eye tracking system was used to emphasize the part of the image that humans consider most important.…”

Section: Introductionmentioning

confidence: 99%

“…al. [26] 50.9% Ullah et al [7] 55.7% Wang et al [3] 58.3% Jiang et al [4] 60.3% Vig et al [14] 61.9% Jain et al [5] 62.5% Oneaţȃ et al [27] 63.3% Wang et al [6] 64.3% Our method 67.8% Table 2. Comparison of our results to the currently best results reported in literature.…”

Section: Descriptors and Classificationmentioning

confidence: 99%

“…In contrast to Ullah et al [7] and Prest et al [8], who suggested finding the actor and other persons in the video directly with a person detector, we propose to first detect the foreground items based on motion before running poselet detectors [12] to localize the (relevant) actor(s) in these foreground areas. While there has been much progress on person detectors in Fig.…”

Section: Introductionmentioning

confidence: 99%

“…State-of-the-art works rather rely on global feature aggregation that do not make explicit use of the notion of an actor [2][3][4][5][6]. Exceptions are Ullah et al [7], who run a person detector to find persons, and Prest et al [8], who even try to detect action specific objects, such as cups and cigarettes. Wang et.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Motion Based Foreground Detection and Poselet Motion Features for Action Recognition

Kraft

Brox

2015

Computer Vision -- ACCV 2014

View full text Add to dashboard Cite

Abstract. For action recognition, the actor(s) and the tools they use as well as their motion are of central importance. In this paper, we propose separating foreground items of an action from the background on the basis of motion cues. As a consequence, separate descriptors can be defined for the foreground regions, while combined foreground-background descriptors still capture the context of an action. Also a low-dimensional global camera motion descriptor can be computed. Poselet activations in the foreground area indicate the actor and its pose. We propose tracking these poselets to obtain detailed motion features of the actor. Experiments on the Hollywood2 dataset show that foreground-background separation and the poselet motion features lead to consistently favorable results, both relative to the baseline and in comparison to the current state-of-the-art.

show abstract

Section: Methodsmentioning

confidence: 45%

Section: Introductionmentioning

confidence: 99%