2017
DOI: 10.1016/j.imavis.2016.06.001
|View full text |Cite
|
Sign up to set email alerts
|

Comparing random forest approaches to segmenting and classifying gestures

Abstract: A complete gesture recognition system should localize and classify each gesture from a given gesture vocabulary, within a continuous video stream. In this work, we compare two approaches: a method that performs the tasks of temporal segmentation and classification simultaneously with another that performs the tasks sequentially. The first method trains a single random forest model to recognize gestures from a given vocabulary, as presented in a training dataset of video plus 3D body joint locations, as well as… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 52 publications
(12 citation statements)
references
References 26 publications
0
11
0
1
Order By: Relevance
“…The main process of random forest is constructing the set of decision trees as in Fig. 14 during the training phase and finally output the class in the case of classification or the prediction in the case of regression [47].…”
Section: Random Forest Classifiermentioning
confidence: 99%
“…The main process of random forest is constructing the set of decision trees as in Fig. 14 during the training phase and finally output the class in the case of classification or the prediction in the case of regression [47].…”
Section: Random Forest Classifiermentioning
confidence: 99%
“…No entanto, além de não ser destinada ao problema do HRI, a sua aquisição foi realizada utilizando o sensor Microsoft Kinect 360. Assim, os dados de natureza multimodal (cor, profundidade, esqueleto,áudio e máscara de usuário) fazem com que os principais trabalhos, como (Li et al, 2017), (Joshi et al, 2017), (Efthimiou et al, 2016), (Neverova et al, 2016), concentrem-se no uso da multimodalidade dos dados, quase que descartando a possibilidade do uso apenas de informação de cor. Isso, como discutido antes, pode condicionar seu uso apenas em ambientes possuidores de tais sensores, o que não acorre na maioria dos ambientes, onde as câmeras RGB são facilmente encontradas.…”
Section: Trabalhos Relacionadosunclassified
“…For feature encoding, the representative methods include bag of visual words (BoVW) [20], vector of locally aggregated descriptors (VLAD) [22] and fisher vector (FV) [7]. For the decision-making stage, the popular classifiers applied to the datasets in the Table 2 include KNN [20,22], SVM [12,7] and random forest [31]. Table 3 summarises the attributes of the human pose estimation dataset used for evaluation by some of the work presented in this special issue.…”
Section: The State-of-the-art In Human Motion Analysismentioning
confidence: 99%