The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2010
DOI: 10.1007/s13173-010-0009-z
|View full text |Cite
|
Sign up to set email alerts
|

Comparative evaluation of static gesture recognition techniques based on nearest neighbor, neural networks and support vector machines

Abstract: It is a common behavior for human beings to use gestures as a means of expression, as a complement to speaking, or as a self-contained communication mode. In the field of Human-Computer Interaction, this behavior can be adopted to build alternative interfaces, aiming to ease the relationship between the human element and the computational element. Currently, various gesture recognition techniques are described in the technical literature; however, the validation studies of these techniques are usually performe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…Since SVMs [29] have gained much attention in recent times due to their powerful generalization capabilities as gesture classifiers [16], [18] we evaluate different feature learning schemes using SVMs. The following approaches are evaluated in this paper using our dataset: (i) The authors in [30], [31], [32] use Hu Invariant Moments for feature learning from images of different objects and gestures; (ii) Unsupervised feature learning is applied by authors in [33] using the Spatial Pyramid (generally referred to as Bag of Features or Bag of Words (BoW)) a combination of SIFT and k-means; (iii) Shape properties of objects such as roundness, form factor, compactness, eccentricity, perimeter, solidity etc are used by the authors in [31], [34]; (iv) Skeletonization has been proposed by the authors in [35], [36] for gesture recognition tasks, such as the counting the number of fingers; (v) Pyramid of Histogram Oriented Gradients (PHOG) [37], a variant of the famous HOG descriptor [38], gained popularity for its vectorized HOG feature learning approach; (vi) The Fast Fourier Transform (FFT) has been used by the authors in [39] to represent the shape of the hand contour in images using the spatial domain; (vii) CNNs called Tiled CNNs [40] are supervised feature learners and classifiers able to learn complex invariances such as scale and rotational invariance.…”
Section: A Existing Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…Since SVMs [29] have gained much attention in recent times due to their powerful generalization capabilities as gesture classifiers [16], [18] we evaluate different feature learning schemes using SVMs. The following approaches are evaluated in this paper using our dataset: (i) The authors in [30], [31], [32] use Hu Invariant Moments for feature learning from images of different objects and gestures; (ii) Unsupervised feature learning is applied by authors in [33] using the Spatial Pyramid (generally referred to as Bag of Features or Bag of Words (BoW)) a combination of SIFT and k-means; (iii) Shape properties of objects such as roundness, form factor, compactness, eccentricity, perimeter, solidity etc are used by the authors in [31], [34]; (iv) Skeletonization has been proposed by the authors in [35], [36] for gesture recognition tasks, such as the counting the number of fingers; (v) Pyramid of Histogram Oriented Gradients (PHOG) [37], a variant of the famous HOG descriptor [38], gained popularity for its vectorized HOG feature learning approach; (vi) The Fast Fourier Transform (FFT) has been used by the authors in [39] to represent the shape of the hand contour in images using the spatial domain; (vii) CNNs called Tiled CNNs [40] are supervised feature learners and classifiers able to learn complex invariances such as scale and rotational invariance.…”
Section: A Existing Approachesmentioning
confidence: 99%
“…Recently, different research efforts on 2D appearance model-based methods for gesture recognition have emerged [9], [10], [11], [12], [13], [14], [15], amongst which supervised and unsupervised learning techniques such as Neural Networks (NNs), Support Vector Machine (SVMs) and NearestNeighbor [16], [17], [18] classifiers have gained familiarity. However, feature learning is not a part of such classification schemes and needs to be performed separately to compute features such as edges, gradients, pixel intensities and object shape.…”
Section: Introductionmentioning
confidence: 99%
“…But the space of HAR is limited into a small part of the environment, where the HAR sensor has been placed. Obviously, absolute anchoring approaches are principally vision-based methods and can be further categorized by the function of modeling variations in time: direct classification, which classifies image features without using information about the time factor and HAR is usually performed directly for each frame individually such as in [12], [13], [14] and [15]. Furthermore, temporal state-space methods where temporal data appears as a particular dimension and where every observation is equivalent to an image representation in given a time such as in [16], [17], [18] and [19].…”
Section: Related Workmentioning
confidence: 99%